검색결과 : 2건
No. | Article |
---|---|
1 |
Simulation-based uniform value function estimates of Markov decision processes Jain R, Varaiya PP SIAM Journal on Control and Optimization, 45(5), 1633, 2006 |
2 |
Some contributions to fixed-distribution learning theory Vidyasagar M, Kulkarni SR IEEE Transactions on Automatic Control, 45(2), 217, 2000 |