Publication - Combining Dynamic Reward Shaping and Action Shaping for Coordinating Multi-Agent Learning
| Authors: | Zhu, Xiangbin; Zhang, Chongjie; Lesser, Victor | ||||
| Title: | Combining Dynamic Reward Shaping and Action Shaping for Coordinating Multi-Agent Learning | ||||
| Abstract: | Coordinating multi-agent reinforcement learning provides a promising approach to scaling learning in large cooperative multi-agent systems. It allows agents to learn local decision policies based on their local observations and rewards, and, meanwhile, coordinates agents´ learning processes to ensure the global learning performance. One key question is that how coordination mechanisms impact learning algorithms so that agents´ learning processes are guided and coordinated. This paper presents a new shaping approach that effectively integrates coordination mechanisms into local learning processes. This shaping approach uses two-level agent organization structures and combines reward shaping and action shaping. The higher-level agents dynamically and periodically produce the shaping heuristic knowledge based on the learning status of the lower-level agents. The lower-level agents then uses this knowledge to coordinate their local learning processes with other agents. Experimental results show our approach effectively speeds up the convergence of multi-agent learning in large systems. | ||||
| Keywords: | Learning, Multi-Agent Systems, Organizational Control | ||||
| Publication: | Proc. of 2013 IEEE/WIC/ACM International Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT), Vol: 2, pp. 321 - 328 | ||||
| Location: | Atlanta, GA | ||||
| Publisher: | IEEE Computer Society Press | ||||
| Date: | 2013 | ||||
| Sources: |
PDF: /Documents/lesser/zhu_IAT13.pdf HTML: http://doi.ieeecomputersociety.org/10.1109/WI-IAT.2013.127 |
||||
| Reference: | Xiangbin Zhu, Chongjie Zhang, Victor Lesser (2013). "Combining Dynamic Reward Shaping and Action Shaping for Coordinating Multi-agent Learning," Proc. of IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT), Vol. 2, pp. 321-328. [View Details]
|
||||
| bibtex: | @inproceedings{Zhu-523,
author = "Xiangbin Zhu and Chongjie Zhang and Victor Lesser",
title = "{Combining Dynamic Reward Shaping and Action
Shaping for Coordinating Multi-Agent Learning}",
booktitle = "Proc. of 2013 IEEE/WIC/ACM International
Conferences on Web Intelligence (WI) and
Intelligent Agent Technologies (IAT)",
volume = "2",
publisher = "IEEE Computer Society Press",
pages = "321-328",
year = "2013",
address = "Atlanta, GA",
url = "http://mas.cs.umass.edu/paper/523",
}
|
||||