Delayed Soft Actor-Critic Based Path Planning Method for UAV in Dense Obstacles Environment

2023 9th International Conference on Control Science and Systems Engineering (ICCSSE)(2023)

引用 0|浏览3
暂无评分
摘要
In order to improve the convergence performance of soft actor-critic (SAC) algorithm in path planning problems, a delayed prioritized experience replay soft actor critic (DPERSAC) is proposed by designing a novel experience replay mechanism in a non-uniform manner for decreasing the convergence time. The path planning mathematical model is built for unmanned aerial vehicles (UAVs) subject to the flight performance constraints and obstacle avoidance constraints. Then the three typical elements of SAC are customized to satisfy the requirements of UAV's path planning. Differ from the traditional update manner that the soft Q-function network and policy network are updated recursively, the soft Q-function network is updated conditionally firstly and the policy network is subsequently iterated based on the trained soft Q-function in this paper. Finally, the Monte Carlo simulation results demonstrate that the computational time of the proposed DPERSAC method is only 4% of the rolling-based sparse A * algorithm in the dense obstacle environment.
更多
查看译文
关键词
Reinforcement Learning,Flight Path Planning,Soft Actor-Critic,Prioritized Experience Replay
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要