Faster, Stronger, and More Interpretable: Massive Transformer Architectures for Vision-Language Tasks.

Tong Chen, Sicong Liu, Zhiran Chen, Wenyan Hu, Dachi Chen,Yuanxin Wang, Qi Lyu, Cindy X. Le,Wenping Wang

Adv. Artif. Intell. Mach. Learn.(2023)

引用 0|浏览1
暂无评分
关键词
massive transformer architectures,vision-language
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要