Stochastic Collapse: How Gradient Noise Attracts SGD Dynamics Towards Simpler Subnetworks
NeurIPS 2023(2023)
关键词
Implicit Bias,SGD Dynamics,Implicit regularization,Learning rate schedule,Stochastic Gradient Descent,Invariant set,Attractive saddle points,Stochastic collapse,Permutation invariance,Simplicity bias,Teacher-student
AI 理解论文
溯源树
样例

生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要