Scaling Text-Rich Image Understanding Via Code-Guided Synthetic Multimodal Data GenerationYue Yang,Ajay Patel,Matt Deitke,Tanmay Gupta,Luca Weihs,Andrew Head,Mark Yatskar,Chris Callison-Burch,Ranjay Krishna,Aniruddha Kembhavi,Christopher ClarkCoRR(2025)引用 0|浏览4AI 理解论文溯源树样例生成溯源树,研究论文发展脉络Chat Paper正在生成论文摘要