DROID: Learning from Offline Heterogeneous Demonstrations via Reward-Policy Distillation.Sravan Jayanthi,Letian Chen, Nadya Balabanska, Van Duong, Erik Scarlatescu, Ezra Ameperosa,Zulfiqar Haider Zaidi,Daniel Martin, Taylor Keith Del Matto, Masahiro Ono,Matthew C. GombolayConference on Robot Learning(2023)引用 0|浏览3暂无评分AI 理解论文溯源树样例生成溯源树,研究论文发展脉络Chat Paper正在生成论文摘要