A Neural Network Approach for High-Dimensional Optimal Control Applied to Multiagent Path Finding

IEEE Transactions on Control Systems Technology(2023)

引用 10|浏览40
暂无评分
摘要
We propose a neural network (NN) approach that yields approximate solutions for high-dimensional optimal control (OC) problems and demonstrate its effectiveness using examples from multiagent path finding. Our approach yields control in a feedback form, where the policy function is given by an NN. In particular, we fuse the Hamilton-Jacobi-Bellman (HJB) and Pontryagin maximum principle (PMP) approaches by parameterizing the value function with an NN. Our approach enables us to obtain approximately OCs in real time without having to solve an optimization problem. Once the policy function is trained, generating a control at a given space-time location takes milliseconds; in contrast, efficient nonlinear programming methods typically perform the same task in seconds. We train the NN offline using the objective function of the control problem and penalty terms that enforce the HJB equations. Therefore, our training algorithm does not involve data generated by another algorithm. By training on a distribution of initial states, we ensure the controls' optimality on a large portion of the state space. Our grid-free approach scales efficiently to dimensions where grids become impractical or infeasible. We apply our approach to several multiagent collision-avoidance problems in up to 150 dimensions. Furthermore, we empirically observe that the number of parameters in our approach scales linearly with the dimension of the control problem, thereby mitigating the curse of dimensionality.
更多
查看译文
关键词
Artificial neural networks,Trajectory,Costs,Aerospace electronics,Real-time systems,Optimal control,Optimization,Collision avoidance,Hamilton-Jacobi-Bellman (HJB) equation,high-dimensional control,multiagent,neural networks (NNs),optimal control (OC)
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要