Publication - Transition-Independent Decentralized Markov Decision Processes

Authors: Becker, Raphen; Zilberstein, Shlomo; Lesser, Victor; Goldman, Claudia V.
Title: Transition-Independent Decentralized Markov Decision Processes
Abstract: There has been substantial progress with formal models for sequential decision making by individual agents using the Markov decision process (MDP). However, similar treatment of multi-agent systems is lacking. A recent complexity result, showing that solving decentralized MDPs is NEXP-hard, provides a partial explanation. To overcome this complexity barrier, we identify a general class of transition-independent decentralized MDPs that is widely applicable. The class consists of independent collaborating agents that are tied together through a global reward function that depends upon both of their histories. We present a novel algorithm for solving this class of problems and examine its properties. The result is the first effective technique to solve optimally a class of decentralized MDPs. This lays the foundation for further work in this area on both exact and approximate solutions.
Keywords: Coordination, Distributed MDP, MDP, Multi-Agent Systems, Planning
Publication: Proceedings of the Second International Joint Conference on Autonomous Agents and Multi Agent Systems, pp. 41 - 48
Location: Melbourne, Australia
Publisher: ACM Press
Date: July 2003
Sources: PDF: http://mas.cs.umass.edu/~raphen/Papers/AAMAS-03/p470-becker.pdf
PDF: /Documents/raphen/AAMAS-03/p470-becker.pdf
PDF: http://cs.umass.edu/~raphen/publications/p470-becker.pdf
Notes: Received BEST PAPER AWARD.
Reference: Becker, Raphen; Zilberstein, Shlomo; Lesser, Victor; Goldman, Claudia V.. Transition-Independent Decentralized Markov Decision Processes. Proceedings of the Second International Joint Conference on Autonomous Agents and Multi Agent Systems, ACM Press, pp. 41-48. July 2003. Received BEST PAPER AWARD.
bibtex:
@inproceedings{Becker-253,
  author    = "Raphen Becker and Shlomo Zilberstein and Victor
               Lesser and Claudia V. Goldman",
  title     = "{Transition-Independent Decentralized Markov
               Decision Processes}",
  booktitle = "Proceedings of the Second International Joint
               Conference on Autonomous Agents and Multi Agent
               Systems",
  publisher = "ACM Press",
  pages     = "41-48",
  month     = "July",
  year      = "2003",
  address   = "Melbourne, Australia",
  url       = "http://mas.cs.umass.edu/paper/253",
}