검색결과 : 4건
No. | Article |
---|---|
1 |
A stability criterion for two timescale stochastic approximation schemes Lakshminarayanan C, Bhatnagar S Automatica, 79, 108, 2017 |
2 |
Natural actor-critic algorithms Bhatnagar S, Sutton RS, Ghavamzadeh M, Lee M Automatica, 45(11), 2471, 2009 |
3 |
New algorithms of the Q-learning type Bhatnagar S, Babu KM Automatica, 44(4), 1111, 2008 |
4 |
A simultaneous perturbation Stochastic approximation-based actor-critic algorithm for Markov decision processes Bhatnagar S, Kumar S IEEE Transactions on Automatic Control, 49(4), 592, 2004 |