Publication - Coordinated Multi-Agent Reinforcement Learning in Networked Distributed POMDPs
| Authors: | Zhang, Chongjie; and Lesser, Victor | ||||
| Title: | Coordinated Multi-Agent Reinforcement Learning in Networked Distributed POMDPs | ||||
| Abstract: | In many multi-agent applications such as distributed sensor nets, a network of agents act collaboratively un- der uncertainty and local interactions. Networked Distributed POMDP (ND-POMDP) provides a framework to model such cooperative multi-agent decision making. Existing work on ND-POMDPs has focused on offline techniques that require accurate models, which are usu- ally costly to obtain in practice. This paper presents a model-free, scalable learning approach that synthesizes multi-agent reinforcement learning (MARL) and dis- tributed constraint optimization (DCOP). By exploiting structured interaction in ND-POMDPs, our approach distributes the learning of the joint policy and employs DCOP techniques to coordinate distributed learning to ensure the global learning performance. Our approach can learn a globally optimal policy for ND-POMDPs with a property called groupwise observability. Exper- imental results show that, with communication during learning and execution, our approach significantly out- performs the nearly-optimal non-communication poli- cies computed offline. | ||||
| Keywords: | Communication, Coordination, Distributed AI, Distributed MDP, Learning, Multi-Agent Systems, Uncertainty | ||||
| Publication: | Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence (AAAI-11), pp. 764 - 770 | ||||
| Location: | San Francisco, California, USA | ||||
| Date: | 2011 | ||||
| Sources: |
PDF: /Documents/aaai11-zhang.pdf |
||||
| Reference: | Zhang, Chongjie; and Lesser, Victor. Coordinated Multi-Agent Reinforcement Learning in Networked Distributed POMDPs. Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence (AAAI-11), pp. 764-770. 2011. | ||||
| bibtex: | @inproceedings{Zhang-505,
author = "Chongjie Zhang and Victor Lesser",
title = "{Coordinated Multi-Agent Reinforcement Learning in
Networked Distributed POMDPs}",
booktitle = "Proceedings of the Twenty-Fifth AAAI Conference on
Artificial Intelligence (AAAI-11)",
pages = "764-770",
year = "2011",
address = "San Francisco, California, USA",
url = "http://mas.cs.umass.edu/paper/505",
}
|
||||