BigDocs: an Open Dataset for Training Multimodal Models on Document and Code Tasks
Juan A. Rodriguez, Xiangru Jian,Siba Smarak Panigrahi,Tianyu Zhang,Aarash Feizi, Abhay Puri, Akshay Suresh,François Savard,Ahmed Masry,Shravan Nayak,Rabiul Awal, Mahsa Massoud,Amirhossein Abaskohi,Zichao Li,Suyuchen Wang,Pierre-André Noël,Mats L. Richter, Saverio Vadacchino, Shubham Agarwal,Sanket Biswas, Sara Shanian, Ying Zhang, Sathwik Tejaswi Madhusudhan,Joao Monteiro,Krishnamurthy Dvijotham,Torsten Scholak,Nicolas Chapados, Sepideh Kharaghani,Sean Hughes,M. Tamer Özsu,Siva Reddy,Marco Pedersoli,Yoshua Bengio,Christopher Pal,Issam Laradji,Spandana Gella,Perouz Taslakian,David Vazquez,Sai Rajeswar ICLR 2025(2025)
AI 理解论文
溯源树
样例
