Distributed reinforcement learning for a traffic engineering application

Agents(2000)

引用 22|浏览0
暂无评分
摘要
In this paper, we report on novel reinforcement learning tech- niques applied to a real-world application. The problem do- main, a traffic engineering application, is formulated as a distributed reinforcement learning problem, where the re- turns of many agents are simultaneously updating a single shared policy. Learning occurs off-line in a traffic simulator, which allows us to retrieve and exploit good transient poli- cies even in the presence of instabilities in the learning. We introduce two new algorithms developed for this situation, one which is a value function based, and one that employs a direct policy evaluation approach. While the latter is the- oretically better motivated in several ways than the former, we find both perform comparably well in this domain and for the formulation we use.
更多
查看译文
关键词
traffic engineering application,artificial intelligence,value function,reinforcement learning
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要