Integrating reinforcement learning with human demonstrations of varying ability

Matthew E. Taylor,Halit Bener Suay,Sonia Chernova

AAMAS（2011）

引用 159|浏览23

暂无评分

摘要

This work introduces Human-Agent Transfer (HAT), an algorithm that combines transfer learning, learning from demonstration and reinforcement learning to achieve rapid learning and high performance in complex domains. Using experiments in a simulated robot soccer domain, we show that human demonstrations transferred into a baseline policy for an agent and refined using reinforcement learning significantly improve both learning time and policy performance. Our evaluation compares three algorithmic approaches to incorporating demonstration rule summaries into transfer learning, and studies the impact of demonstration quality and quantity, as well as the effect of combining demonstrations from multiple teachers. Our results show that all three transfer methods lead to statistically significant improvement in performance over learning without demonstration. The best performance was achieved by combining the best demonstrations from two teachers.

查看译文

关键词

demonstration quality,varying ability,best demonstration,rapid learning,demonstration rule summary,integrating reinforcement,high performance,policy performance,human demonstration,best performance,transfer learning,transfer method,reinforcement learning

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要