Stochastic Online Optimization for Cyber-Physical and Robotic Systems
arxiv(2024)
摘要
We propose a novel gradient-based online optimization framework for solving
stochastic programming problems that frequently arise in the context of
cyber-physical and robotic systems. Our problem formulation accommodates
constraints that model the evolution of a cyber-physical system, which has, in
general, a continuous state and action space, is nonlinear, and where the state
is only partially observed. We also incorporate an approximate model of the
dynamics as prior knowledge into the learning process and show that even rough
estimates of the dynamics can significantly improve the convergence of our
algorithms. Our online optimization framework encompasses both gradient descent
and quasi-Newton methods, and we provide a unified convergence analysis of our
algorithms in a non-convex setting. We also characterize the impact of modeling
errors in the system dynamics on the convergence rate of the algorithms.
Finally, we evaluate our algorithms in simulations of a flexible beam, a
four-legged walking robot, and in real-world experiments with a ping-pong
playing robot.
更多查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要