Applied Mathematics and Optimization, Vol.37, No.3, 269-293, 1998
Adaptive control of a partially observed discrete time Markov process
An adaptive control problem of a discrete time Markov process that is completely observed in a fixed recurrent domain and is partially observed elsewhere is formulated and a solution is given by constructing an approximately self-optimal strategy. The State space of the Markov process is either a closed subset of Euclidean space or a countable set. Another adaptive control problem is solved where the process is always only partially observed but there is a family of random times when the process evaluated at these times is a family of independent, identically distributed random variables.
Keywords:SELF-OPTIMIZING STRATEGIES;SCHEME