Language Model Beats Diffusion - Tokenizer is Key to Visual Generation.Lijun Yu,Jose Lezama,Nitesh Bharadwaj Gundavarapu,Luca Versari,Kihyuk Sohn,David Minnen,Yong Cheng,Agrim Gupta,Xiuye Gu,Alexander G Hauptmann,Boqing Gong,Ming-Hsuan Yang,Irfan Essa,David A Ross,Lu JiangICLR 2024(2024)引用 305|浏览194关键词language model,diffusion model,video generation,visual tokenizationAI 理解论文溯源树样例生成溯源树,研究论文发展脉络Chat Paper正在生成论文摘要