Publication - Multi-Agent Learning with Policy Prediction
| Authors: | Zhang, Chongjie; Lesser, Victor | ||||
| Title: | Multi-Agent Learning with Policy Prediction | ||||
| Abstract: | Due to the non-stationary environment, learning in multi-agent systems is a challenging problem. This paper first introduces a new gradient-based algorithm by using policy prediction in basic gradient ascent. We prove that this modification results in a stronger notion of convergence than basic gradient ascent, that is, strategies converge to a Nash equilibrium within a restricted class of iterated games. Motivated by this modification, we then propose a new practical multi-agent reinforcement learning (MARL) algorithm exploiting approximate policy prediction. Empirical results show that it converges faster and in a wider variety of situations than state-of-the-art MARL algorithms. | ||||
| Keywords: | Learning, Multi-Agent Systems | ||||
| Publication: | Proceedings of the 24th AAAI Conference on Artificial Intelligence, pp. 927 - 934 | ||||
| Location: | Atlanta | ||||
| Date: | 2010 | ||||
| Sources: | PDF: http://www.cs.umass.edu/~chongjie/pub/aaai2010_zhang.pdf PDF: /Documents/cjzhang/aaai10_zhang.pdf | ||||
| Reference: | Zhang, Chongjie; Lesser, Victor. Multi-Agent Learning with Policy Prediction. Proceedings of the 24th AAAI Conference on Artificial Intelligence, pp. 927-934. 2010. | ||||
| bibtex: | @inproceedings{Zhang-487,
  author    = "Chongjie Zhang and Victor Lesser",
  title     = "{Multi-Agent Learning with Policy Prediction}",
  booktitle = "Proceedings of the 24th AAAI Conference on
               Artificial Intelligence",
  pages     = "927-934",
  year      = "2010",
  address   = "Atlanta",
  url       = "http://mas.cs.umass.edu/paper/487",
}
 | ||||
