One-Shot Learning of Manipulation Skills with Online Dynamics Adaptation and Neural Network Priors

2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)(2016)

引用 165|浏览264
暂无评分
摘要
One of the key challenges in applying reinforcement learning to complex robotic control tasks is the need to gather large amounts of experience in order to find an effective policy for the task at hand. Model-based reinforcement learning can achieve good sample efficiency, but requires the ability to learn a model of the dynamics that is good enough to learn an effective policy. In this work, we develop a model-based reinforcement learning algorithm that combines prior knowledge from previous tasks with online adaptation of the dynamics model. These two ingredients enable highly sample-efficient learning even in regimes where estimating the true dynamics is very difficult, since the online model adaptation allows the method to locally compensate for unmodeled variation in the dynamics. We encode the prior experience into a neural network dynamics model, adapt it online by progressively refitting a local linear model of the dynamics, and use model predictive control to plan under these dynamics. Our experimental results show that this approach can be used to solve a variety of complex robotic manipulation tasks in just a single attempt, using prior data from other manipulation behaviors.
更多
查看译文
关键词
one-shot learning,online dynamics adaptation,manipulation skills,complex robotic control,model-based reinforcement learning algorithm,sample-efficient learning,online model adaptation,unmodeled variation compensate,neural network dynamics model,local linear model,model predictive control,complex robotic manipulation tasks
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要