Chrome Extension
WeChat Mini Program
Use on ChatGLM

Reinforcement Learning Generalization with Surprise Minimization

arXiv (Cornell University)(2020)

Cited 0|Views4
No score
Abstract
Generalization remains a challenging problem for reinforcement learning algorithms, which are often trained and tested on the same set of environments. When test environments are perturbed but the task is semantically the same, agents can still fail to perform accurately. Particularly when they are trained on high-dimensional state spaces, such as images. We evaluate an surprise minimizing agent on a generalization benchmark to show an additional reward learned from a density model can help agents acquire robust skills on unseen procedurally generated diverse environments.
More
Translated text
Key words
surprise minimization,reinforcement learning,generalization
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined