Publication - Multi-Agent Learning with Policy Prediction

Authors:	Zhang, Chongjie; Lesser, Victor
Title:	Multi-Agent Learning with Policy Prediction
Abstract:	Due to the non-stationary environment, learning in multi-agent systems is a challenging problem. This paper first introduces a new gradient-based algorithm by using policy prediction in basic gradient ascent. We prove that this modification results in a stronger notion of convergence than basic gradient ascent, that is, strategies converge to a Nash equilibrium within a restricted class of iterated games. Motivated by this modification, we then propose a new practical multi-agent reinforcement learning (MARL) algorithm exploiting approximate policy prediction. Empirical results show that it converges faster and in a wider variety of situations than state-of-the-art MARL algorithms.
Keywords:	Learning, Multi-Agent Systems
Publication:	Proceedings of the 24th AAAI Conference on Artificial Intelligence, pp. 927 - 934
Location:	Atlanta
Date:	2010
Sources:	PDF: http://www.cs.umass.edu/~chongjie/pub/aaai2010_zhang.pdf PDF: /Documents/cjzhang/aaai10_zhang.pdf
Reference:	Zhang, Chongjie; Lesser, Victor. Multi-Agent Learning with Policy Prediction. Proceedings of the 24th AAAI Conference on Artificial Intelligence, pp. 927-934. 2010.
`bibtex`:	@inproceedings{Zhang-487, author = "Chongjie Zhang and Victor Lesser", title = "{Multi-Agent Learning with Policy Prediction}", booktitle = "Proceedings of the 24th AAAI Conference on Artificial Intelligence", pages = "927-934", year = "2010", address = "Atlanta", url = "http://mas.cs.umass.edu/paper/487", }