Scaling Language Models: Methods, Analysis & Insights from Training Gopher
Jack W. Rae,Sebastian Borgeaud,Trevor Cai,Katie Millican,Jordan Hoffmann,Francis Song,John Aslanides,Sarah B. Henderson,Roman Ring,Susannah Young,Eliza Rutherford,Tom Hennigan,Jacob Menick,Albin Cassirer,Richard J. Powell,George van den Driessche,Lisa Anne Hendricks,Maribeth Rauh,Po-Sen Huang,Amelia Glaese,Johannes Welbl,Sumanth Dathathri,Saffron Huang,Jonathan Uesato,John W. Mellor,Irina Higgins,Antonia Creswell,Nat McAleese, Amy Wu,Erich Elsen,Siddhant M. Jayakumar,Elena Buchatskaya,David Budden, Esme Sutherland,Karen Simonyan, M. Paganini,Laurent Sifre, Lena Martens,Xiang Lorraine Li,Adhiguna Kuncoro,Aida Nematzadeh,Elena Gribovskaya,Domenic Donato,Angeliki Lazaridou, Michel Arthur,Jean-Baptiste Lespiau,Maria Tsimpoukelli,Nikolai Grigorev,Doug Fritz,Thibault Sottiaux, Mantas Pajarskas,Toby Pohlen, Zhongying Gong,Daniel Toyama,Cyprien de Masson d’Autume,Yujia Li,Tayfun Terzi,Vladimir Mikulik, I. Babuschkin,Aidan Clark,Diego de Las Casas,Aurelia Guy,Chris Jones,James T. Bradbury,Matthew S. Johnson,Blake A. Hechtman,Laura Weidinger,Iason Gabriel,William M. Isaac, Ed Lockhart,Simon Osindero,Laura Rimell,Chris Dyer,Oriol Vinyals,Kareem Ayoub,Jeff Stanway, Lorrayne Bennett,Demis Hassabis,Koray Kavukcuoglu,Geoffrey Irving
arXiv (Cornell University)(2021)
引用 278|浏览106
关键词
language models,training
AI 理解论文
溯源树
样例

生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要