DROID: Learning from Offline Heterogeneous Demonstrations via Reward-Policy Distillation.

Sravan Jayanthi,Letian Chen, Nadya Balabanska, Van Duong, Erik Scarlatescu, Ezra Ameperosa,Zulfiqar Haider Zaidi,Daniel Martin, Taylor Keith Del Matto, Masahiro Ono,Matthew C. Gombolay

Conference on Robot Learning(2023)

引用 0|浏览3
暂无评分
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要