Publication - An Ensemble Architecture for Learning Complex Problem-Solving Techniques from Demonstration
Authors: | Zhang, X.Shelley; Yoon, S.; DiBona, P.; Appling, D.S.; Ding, L.; Doppa, J.R.; Greeny, D.; Guo, J.K.; Kuter, U.; Levine, G.; MacTavish, R.L.; McFarlane, D.; Michaelis, J.R.; Mostafa, Hala; Ontañón, S.; Parker, C.; Radhakrishnan, J.; Rebguns, A.; Shrestha, B.; Song, Z.; Trewhitt, E.B.; Zafar, Huzaifa; Zhang, Chongjie; Corkill, Daniel; DeJong, G.; Dietterich, T.G.; Kambhampati, S.; Lesser, Victor; et al | ||||
Title: | An Ensemble Architecture for Learning Complex Problem-Solving Techniques from Demonstration | ||||
Abstract: | We present a novel ensemble architecture for learning problem-solving techniques from a very small number of expert solutions and demonstrate its effectiveness in a complex real-world domain. The key feature of our “Generalized Integrated Learning Architecture” (GILA) is a set of integrated learning and reasoning (ILR) components, coordinated by a central meta-reasoning executive (MRE). The ILRs are weakly coupled in the sense that all coordination happens through the MRE. Each ILR learns independently from a small number of expert demonstrations of a complex task. During the performance, each ILR proposes partial solutions to subproblems posed by the MRE, which are then selected from and pieced together by the MRE to produce a complete solution. We describe the application of this novel learning and problem-solving architecture to the domain of airspace management, where multiple requests for the use of airspaces need to be deconflicted, reconciled and managed automatically. Formal evaluations show that our system performs as well as or better than humans after learning from the same training data. Furthermore, GILA outperforms any individual ILR run in isolation, thus demonstrating the power of the ensemble architecture for learning and problem solving. | ||||
Keywords: | Learning | ||||
Publication: | ACM Transactions on Intelligent Systems and Technology (TIST), Vol: 3, Num: 4, Article 75, pp. 1 - 38 | ||||
Publisher: | ACM | ||||
Date: | September 2012 | ||||
Sources: |
PDF: /Documents/lesser/XZhang_TIST_a75.pdf |
||||
Reference: | Zhang, X.Shelley; Yoon, S.; DiBona, P.; Appling, D.S.; Ding, L.; Doppa, J.R.; Greeny, D.; Guo, J.K.; Kuter, U.; Levine, G.; MacTavish, R.L.; McFarlane, D.; Michaelis, J.R.; Mostafa, Hala; Ontañón, S.; Parker, C.; Radhakrishnan, J.; Rebguns, A.; Shrestha, B.; Song, Z.; Trewhitt, E.B.; Zafar, Huzaifa; Zhang, Chongjie; Corkill, Daniel; DeJong, G.; Dietterich, T.G.; Kambhampati, S.; Lesser, Victor; et al. An Ensemble Architecture for Learning Complex Problem-Solving Techniques from Demonstration. ACM Transactions on Intelligent Systems and Technology (TIST), Volume 3, Number 4, Article 75, ACM, pp. 1-38. September 2012. | ||||
bibtex: | @article{Zhang-514, author = "X.Shelley Zhang and S. Yoon and P. DiBona and D.S. Appling and L. Ding and J.R. Doppa and D. Greeny and J.K. Guo and U. Kuter and G. Levine and R.L. MacTavish and D. McFarlane and J.R. Michaelis and Hala Mostafa and S. Ontañón and C. Parker and J. Radhakrishnan and A. Rebguns and B. Shrestha and Z. Song and E.B. Trewhitt and Huzaifa Zafar and Chongjie Zhang and Daniel Corkill and G. DeJong and T.G. Dietterich and S. Kambhampati and Victor Lesser and et al", title = "{An Ensemble Architecture for Learning Complex Problem-Solving Techniques from Demonstration}", journal = "ACM Transactions on Intelligent Systems and Technology (TIST)", volume = "3", number = "4, Article 75", publisher = "ACM", pages = "1-38", month = "September", year = "2012", url = "http://mas.cs.umass.edu/paper/514", } |