Publication - Coordinated Multi-Agent Reinforcement Learning in Networked Distributed POMDPs
Authors: | Zhang, Chongjie; and Lesser, Victor | ||||
Title: | Coordinated Multi-Agent Reinforcement Learning in Networked Distributed POMDPs | ||||
Abstract: | In many multi-agent applications such as distributed sensor nets, a network of agents act collaboratively un- der uncertainty and local interactions. Networked Distributed POMDP (ND-POMDP) provides a framework to model such cooperative multi-agent decision making. Existing work on ND-POMDPs has focused on offline techniques that require accurate models, which are usu- ally costly to obtain in practice. This paper presents a model-free, scalable learning approach that synthesizes multi-agent reinforcement learning (MARL) and dis- tributed constraint optimization (DCOP). By exploiting structured interaction in ND-POMDPs, our approach distributes the learning of the joint policy and employs DCOP techniques to coordinate distributed learning to ensure the global learning performance. Our approach can learn a globally optimal policy for ND-POMDPs with a property called groupwise observability. Exper- imental results show that, with communication during learning and execution, our approach significantly out- performs the nearly-optimal non-communication poli- cies computed offline. | ||||
Keywords: | Communication, Coordination, Distributed AI, Distributed MDP, Learning, Multi-Agent Systems, Uncertainty | ||||
Publication: | Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence (AAAI-11), pp. 764 - 770 | ||||
Location: | San Francisco, California, USA | ||||
Date: | 2011 | ||||
Sources: |
PDF: /Documents/aaai11-zhang.pdf |
||||
Reference: | Zhang, Chongjie; and Lesser, Victor. Coordinated Multi-Agent Reinforcement Learning in Networked Distributed POMDPs. Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence (AAAI-11), pp. 764-770. 2011. | ||||
bibtex: | @inproceedings{Zhang-505, author = "Chongjie Zhang and Victor Lesser", title = "{Coordinated Multi-Agent Reinforcement Learning in Networked Distributed POMDPs}", booktitle = "Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence (AAAI-11)", pages = "764-770", year = "2011", address = "San Francisco, California, USA", url = "http://mas.cs.umass.edu/paper/505", } |