Publication - Multi-Agent Learning with Policy Prediction
Authors: | Zhang, Chongjie; Lesser, Victor | ||||
Title: | Multi-Agent Learning with Policy Prediction | ||||
Abstract: | Due to the non-stationary environment, learning in multi-agent systems is a challenging problem. This paper first introduces a new gradient-based algorithm by using policy prediction in basic gradient ascent. We prove that this modification results in a stronger notion of convergence than basic gradient ascent, that is, strategies converge to a Nash equilibrium within a restricted class of iterated games. Motivated by this modification, we then propose a new practical multi-agent reinforcement learning (MARL) algorithm exploiting approximate policy prediction. Empirical results show that it converges faster and in a wider variety of situations than state-of-the-art MARL algorithms. | ||||
Keywords: | Learning, Multi-Agent Systems | ||||
Publication: | Proceedings of the 24th AAAI Conference on Artificial Intelligence, pp. 927 - 934 | ||||
Location: | Atlanta | ||||
Date: | 2010 | ||||
Sources: |
PDF: http://www.cs.umass.edu/~chongjie/pub/aaai2010_zhang.pdf PDF: /Documents/cjzhang/aaai10_zhang.pdf |
||||
Reference: | Zhang, Chongjie; Lesser, Victor. Multi-Agent Learning with Policy Prediction. Proceedings of the 24th AAAI Conference on Artificial Intelligence, pp. 927-934. 2010. | ||||
bibtex: | @inproceedings{Zhang-487, author = "Chongjie Zhang and Victor Lesser", title = "{Multi-Agent Learning with Policy Prediction}", booktitle = "Proceedings of the 24th AAAI Conference on Artificial Intelligence", pages = "927-934", year = "2010", address = "Atlanta", url = "http://mas.cs.umass.edu/paper/487", } |