검색결과 : 2건
No. | Article |
---|---|
1 |
On actor-critic algorithms Konda VR, Tsitsiklis JN SIAM Journal on Control and Optimization, 42(4), 1143, 2003 |
2 |
Actor-critic-type learning algorithms for Markov decision processes Konda VR, Borkar VS SIAM Journal on Control and Optimization, 38(1), 94, 1999 |