Auto-learning communication reinforcement learning for multi-intersection traffic light control

Knowledge-Based Systems(2023)

引用 2|浏览9
暂无评分
摘要
Multi-agent reinforcement learning is a promising solution to achieve intelligent traffic light control by regarding each intersection as an independent agent. However, agents encounter partial observability and environmental instability issues when learning optimal strategies. To mitigate the impacts caused by the partial observability of cooperative agents, we propose the auto-learning communication reinforcement learning (ALCORL) method based on the advantage actor–critic algorithm. ALCORL enables intersections to communicate and enhance cooperation by receiving messages from adjacent intersections in multi-intersection scenarios. Specifically, the autoencoder is introduced into ALCORL to dynamically learn communication messages instead of defining specific communication regulations. Different from most studies that control the sequential conversion of phases to improve traffic conditions, we focus on regulating the phase duration directly and scheduling the traffic light time more flexibly. We conduct extensive experiments on different-scale datasets and ever-changing traffic conditions to verify the validity of ALCORL. The experimental results show that ALCORL performs better than several state-of-the-art algorithms in all evaluation metrics.
更多
查看译文
关键词
communication reinforcement auto-learning,reinforcement auto-learning,traffic,multi-intersection
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要