A Linearly Relaxed Approximate Linear Program for Markov Decision Processes

Lakshminarayanan C; Bhatnagar S; Szepesvari C

IEEE Transactions on Automatic Control, Vol.63, No.4, 1185-1191, 2018

DOI10.1109/TAC.2017.2743163 Export Citation

A Linearly Relaxed Approximate Linear Program for Markov Decision Processes

Lakshminarayanan C, Bhatnagar S, Szepesvari C

Approximate linear programming (ALP) and its variants have been widely applied to Markov decision processes (MDPs) with a large number of states. A serious limitation of ALP is that it has an intractable number of constraints, as a result of which constraint approximations are of interest. In this paper, we define a linearly relaxed approximation linear program (LRALP) that has a tractable number of constraints, obtained as positive linear combinations of the original constraints of the ALP. The main contribution is a novel performance bound for LRALP.

Keywords:Approximate linear programming (ALP);Markov decision processes (MDPs)