Vision Transformer with 2D Explicit Position Encoding
IEEE International Conference on Acoustics, Speech, and Signal Processing(2024)
关键词
Deep learning,computer vision,vision transformer(ViT),explicit position encoding
AI 理解论文
溯源树
样例

生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要