Relative Entropy of Correct Proximal Policy Optimization Algorithms with Modified Penalty Factor in Complex Environment.
Entropy(2022)
关键词
correct proximal policy optimization,approximation theory,reinforcement learning,optimization,policy gradient,entropy
AI 理解论文
溯源树
样例

生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要