Two-Level Actor-Critic Using Multiple Teachers

AAMAS '23: Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems(2023)

引用 0|浏览29
暂无评分
摘要
Reinforcement learning (RL) has been successful in a variety of domains ranging from solving difficult games like Go [10] and drug discovery [4]. Most of these domains are characterized by high-dimensional states and continuous action spaces. However, sample inefficiency is a major challenge when applying these algorithms to real-world tasks such as robotics and healthcare care [6]. To address improved sample efficiency, rather than forcing agents to learn from scratch, domain knowledge from humans or existing agents can be leveraged in various ways [3].
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要