An Asymptotic Analysis of Random Partition Based Minibatch Momentum Methods for Linear Regression Models

JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS(2023)

引用 0|浏览5
暂无评分
摘要
Momentum methods have been shown to accelerate the convergence of the standard gradient descent algorithm in practice and theory. In particular, the random partition based minibatch gradient descent methods with momentum (MGDM) are widely used to solve large-scale optimization problems with massive datasets. Despite the great popularity of the MGDM methods in practice, their theoretical properties are still underexplored. To this end, we investigate the theoretical properties of MGDM methods based on the linear regression models. We first study the numerical convergence properties of the MGDM algorithm and derive the conditions for faster numerical convergence rate. In addition, we explore the relationship between the statistical properties of the resulting MGDM estimator and the tuning parameters. Based on these theoretical findings, we give the conditions for the resulting estimator to achieve the optimal statistical efficiency. Finally, extensive numerical experiments are conducted to verify our theoretical results. for this article are available online.
更多
查看译文
关键词
Fixed minibatch,Gradient descent,Momentum method,Numerical convergence rate,Shuffled minibatch,Statistical efficiency
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要