Learning to Play Pursuit-Evasion with Dynamic and Sensor Constraints
arxiv(2024)
摘要
We present a multi-agent reinforcement learning approach to solve a
pursuit-evasion game between two players with car-like dynamics and sensing
limitations. We develop a curriculum for an existing multi-agent deterministic
policy gradient algorithm to simultaneously obtain strategies for both players,
and deploy the learned strategies on real robots moving as fast as 2 m/s in
indoor environments. Through experiments we show that the learned strategies
improve over existing baselines by up to 30
pursuer. The learned evader model has up to 5
baselines even against our competitive pursuer model. We also present
experiment results which show how the pursuit-evasion game and its results
evolve as the player dynamics and sensor constraints are varied. Finally, we
deploy learned policies on physical robots for a game between the F1TENTH and
JetRacer platforms and show that the learned strategies can be executed on
real-robots. Our code and supplementary material including videos from
experiments are available at https: //gonultasbu.github.io/pursuit-evasion/.
更多查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要