Measuring Progress on Scalable Oversight for Large Language ModelsSamuel R. Bowman,Jeeyoon Hyun,Ethan Perez,Edwin Chen,Craig Pettit,Scott Heiner,Kamilė Lukošiūtė,Amanda Askell,Andy Jones,Anna Chen,Anna Goldie,Azalia Mirhoseini,Cameron McKinnon,Christopher Olah,Daniela Amodei,Dario Amodei,Dawn Drain,Dustin Li,Eli Tran-Johnson,Jackson Kernion,Jamie Kerr,Jared Mueller,Jeffrey Ladish,Joshua Landau,Kamal Ndousse,Liane Lovitt,Nelson Elhage,Nicholas Schiefer,Nicholas Joseph,Noemí Mercado,Nova DasSarma,Robin Larson,Sam McCandlish,Sandipan Kundu,Scott Johnston,Shauna Kravec,Sheer El Showk,Stanislav Fort,Timothy Telleen-Lawton,Tom Brown,Tom Henighan,Tristan Hume,Yuntao Bai,Zac Hatfield-Dodds,Ben Mann,Jared KaplanCoRR(2022)引用 53|浏览199关键词large language models,scalable oversight,language modelsAI 理解论文溯源树样例生成溯源树,研究论文发展脉络Chat Paper正在生成论文摘要