Swin-TransUper: Swin Transformer-based UperNet for medical image segmentation

Multimedia Tools and Applications(2024)

引用 0|浏览5
暂无评分
摘要
Convolutional Neural Network-based UNet and its variants have shown remarkable performance in medical image segmentation. However, these methods can only capture local features without spatial correlations and are incapable of global modeling. Previous studies prove that local and global features are critical in computer vision. Therefore, based on the abovementioned considerations, this paper proposes a pure Transformer model named Swin-TransUper. Firstly, we explore extending UperNet by incorporating the hierarchical Swin Transformer with shifted windows, thereby enhancing the global modeling capability of the model. Secondly, we introduce an SPPM (Swin Pyramid Pooling Module) to conduct multi-scale feature mining on the deepest features generated by the encoder, fully considering the semantic information of the deepest features. Finally, the multi-scale attention module aggregates the multi-scale feature information to obtain a more refined feature map. Our method achieves the state-of-the-art performance of 80.08 https://github.com/JianJianYin/Swin-TransUper .
更多
查看译文
关键词
Medical image segmentation,Swin Transformer,Swin-TransUper,UperNet,Convolutional neural network
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要