Global Optimality of Elman-type RNN in the Mean-Field Regime

Andrea Agazzi,Jianfeng Lu, Sayan Mukherjee

arxiv(2023)

引用 1|浏览5
暂无评分
摘要
We analyze Elman-type Recurrent Reural Networks (RNNs) and their training in the mean-field regime. Specifically, we show convergence of gradient descent training dynamics of the RNN to the corresponding mean-field formulation in the large width limit. We also show that the fixed points of the limiting infinite-width dynamics are globally optimal, under some assumptions on the initialization of the weights. Our results establish optimality for feature-learning with wide RNNs in the mean-field regime
更多
查看译文
关键词
global optimality,elman-type,mean-field
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要