Distributed Reinforcement Learning via Gossip

Mathkar A; Borkar VS

IEEE Transactions on Automatic Control, Vol.62, No.3, 1465-1470, 2017

DOI10.1109/TAC.2016.2585302 Export Citation

Distributed Reinforcement Learning via Gossip

We consider the classical TD(0) algorithm implemented on a network of agents wherein the agents also incorporate updates received from neighboring agents using a gossip-like mechanism. The combined scheme is shown to converge for both discounted and average cost problems.

Keywords:Distributed algorithm;gossip;reinforcement learning;stochastic approximation;TD(0)