화학공학소재연구정보센터
검색결과 : 4건
No. Article
1 A stability criterion for two timescale stochastic approximation schemes
Lakshminarayanan C, Bhatnagar S
Automatica, 79, 108, 2017
2 Natural actor-critic algorithms
Bhatnagar S, Sutton RS, Ghavamzadeh M, Lee M
Automatica, 45(11), 2471, 2009
3 New algorithms of the Q-learning type
Bhatnagar S, Babu KM
Automatica, 44(4), 1111, 2008
4 A simultaneous perturbation Stochastic approximation-based actor-critic algorithm for Markov decision processes
Bhatnagar S, Kumar S
IEEE Transactions on Automatic Control, 49(4), 592, 2004