CURIE: Evaluating LLMs on Multitask Scientific Long Context Understanding and Reasoning
Hao Cui, Zahra Shamsi, Gowoon Cheon, Xuejian Ma, Shutong Li, Maria Tikhanovskaya, Peter Norgaard, Nayantara Mudur, Martyna Plomecka, Paul Raccuglia,Yasaman Bahri, Victor V. Albert, Pranesh Srinivasan, Haining Pan, Philippe Faist, Brian Rohr, Michael Statt, Dan Morris, Drew Purves, Elise Kleeman, Ruth Alcantara, Matthew Abraham, Muqthar Mohammad, Ean VanLee, Chenfei Jiang, Elizabeth Dorfman, Eun-Ah Kim,Michael Brenner, Sameera Ponda,Subhashini Venugopalan ICLR 2025(2025)
AI 理解论文
溯源树
样例
