Rethink reporting of evaluation results in AIRyan Burnell,Wout Schellaert,John Burden,Tomer D. Ullman,Fernando Martinez-Plumed,Joshua B. Tenenbaum,Danaja Rutar,Lucy G. Cheke,Jascha Sohl-Dickstein,Melanie Mitchell,Douwe Kiela,Murray Shanahan,Ellen M. Voorhees,Anthony G. Cohn,Joel Z. Leibo,Jose Hernandez-OralloScience(2023)引用 23|浏览69暂无评分摘要Aggregate metrics and lack of access to results limit understanding.更多查看译文关键词evaluation results,ai,rethink reportingAI 理解论文溯源树样例生成溯源树,研究论文发展脉络Chat Paper正在生成论文摘要