Hierarchical Policy Learning With Demonstration Learning for Robotic Multiple Peg-in-Hole Assembly Tasks

IEEE Transactions on Industrial Informatics(2023)

引用 0|浏览9
暂无评分
摘要
The force-based control algorithm of robotic multiple peg-in-hole assembly is a challenge. For the difficulty of low adaptability of model-based control algorithms and low learning efficiency of model-free control algorithms, a goal-based hierarchical policy learning (HPL) algorithm that combines conventional control algorithm and demonstration learning (DL) algorithm is proposed to learn the assembly skill. First, the goal-based HPL algorithm adds goal as a new variable to the action value function. Multiple states reached in each episode are randomly selected as subgoals to improve the distribution of positive rewards. Second, an initial policy that combines conventional control algorithm and DL algorithm is designed. The combined coefficient of these two algorithms is learned by HPL algorithm. Finally, a conical surface is used to compute the forces and moments of simplified assembly simulation model. Our algorithm is well implemented in both simulation and real-world environments. The experimental results verify the effectiveness of the proposed method.
更多
查看译文
关键词
Assembly model,demonstration learning (DL),force-based control algorithm,hierarchical reinforcement learning (HRL),peg-in-hole assembly
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要