Predictability and Surprise in Large Generative Models
Deep Ganguli,Danny Hernandez,Liane Lovitt,Nova DasSarma,Tom Henighan,Andy Jones,Nicholas Joseph,Jackson Kernion,Ben Mann,Amanda Askell,Yuntao Bai,Anna Chen,Tom Conerly,Dawn Drain,Nelson Elhage,Sheer El Showk,Stanislav Fort,Zac Hatfield-Dodds,Scott Johnston,Shauna Kravec,Neel Nanda,Kamal Ndousse,Catherine Olsson,Daniela Amodei,Dario Amodei,Tom Brown,Jared Kaplan,Sam McCandlish,Chris Olah,Jack Clark 2022 ACM Conference on Fairness, Accountability, and Transparency(2022)
关键词
Model Interpretability,Interpretable Models,Machine Learning Interpretability
AI 理解论文
溯源树
样例
