Publication - Combining Dynamic Reward Shaping and Action Shaping for Coordinating Multi-Agent Learning

Authors:	Zhu, Xiangbin; Zhang, Chongjie; Lesser, Victor
Title:	Combining Dynamic Reward Shaping and Action Shaping for Coordinating Multi-Agent Learning
Abstract:	Coordinating multi-agent reinforcement learning provides a promising approach to scaling learning in large cooperative multi-agent systems. It allows agents to learn local decision policies based on their local observations and rewards, and, meanwhile, coordinates agents´ learning processes to ensure the global learning performance. One key question is that how coordination mechanisms impact learning algorithms so that agents´ learning processes are guided and coordinated. This paper presents a new shaping approach that effectively integrates coordination mechanisms into local learning processes. This shaping approach uses two-level agent organization structures and combines reward shaping and action shaping. The higher-level agents dynamically and periodically produce the shaping heuristic knowledge based on the learning status of the lower-level agents. The lower-level agents then uses this knowledge to coordinate their local learning processes with other agents. Experimental results show our approach effectively speeds up the convergence of multi-agent learning in large systems.
Keywords:	Learning, Multi-Agent Systems, Organizational Control
Publication:	Proc. of 2013 IEEE/WIC/ACM International Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT), Vol: 2, pp. 321 - 328
Location:	Atlanta, GA
Publisher:	IEEE Computer Society Press
Date:	2013
Sources:	PDF: /Documents/lesser/zhu_IAT13.pdf HTML: http://doi.ieeecomputersociety.org/10.1109/WI-IAT.2013.127
Reference:	Xiangbin Zhu, Chongjie Zhang, Victor Lesser (2013). "Combining Dynamic Reward Shaping and Action Shaping for Coordinating Multi-agent Learning," Proc. of IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT), Vol. 2, pp. 321-328. [View Details]
`bibtex`:	@inproceedings{Zhu-523, author = "Xiangbin Zhu and Chongjie Zhang and Victor Lesser", title = "{Combining Dynamic Reward Shaping and Action Shaping for Coordinating Multi-Agent Learning}", booktitle = "Proc. of 2013 IEEE/WIC/ACM International Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT)", volume = "2", publisher = "IEEE Computer Society Press", pages = "321-328", year = "2013", address = "Atlanta, GA", url = "http://mas.cs.umass.edu/paper/523", }