On-Policy Robot Imitation Learning from a Converging Supervisor.Ashwin Balakrishna,Brijen Thananjeyan,Jonathan Lee,Felix Li,Arsh Zahed,Joseph E. Gonzalez,Ken GoldbergConference on Robot Learning(2019)引用 14|浏览6关键词Imitation Learning,Online Learning,Reinforcement LearningAI 理解论文溯源树样例生成溯源树,研究论文发展脉络Chat Paper正在生成论文摘要