PaLI-3 Vision Language Models: Smaller, Faster, Stronger
Xi Chen,Xiao Wang,Lucas Beyer,Alexander Kolesnikov,Jialin Wu,Paul Voigtlaender,Sebastian Goodman,Basil Mustafa,Ibrahim Alabdulmohsin,Piotr Padlewski,Daniel Salz,Xi Xiong,Daniel Vlasic,Filip Pavetic,Keran Rong,Tianli Yu,Daniel Keysers,Xiaohua Zhai,Radu Soricut ICLR 2024(2024)
关键词
Vision and Language,Multimodality,Contrastive Learning
AI 理解论文
溯源树
样例
