T-Mamba: Frequency-Enhanced Gated Long-Range Dependency for Tooth 3D CBCT Segmentation
arxiv(2024)
摘要
Efficient tooth segmentation in three-dimensional (3D) imaging, critical for
orthodontic diagnosis, remains challenging due to noise, low contrast, and
artifacts in CBCT images. Both convolutional Neural Networks (CNNs) and
transformers have emerged as popular architectures for image segmentation.
However, their efficacy in handling long-range dependencies is limited due to
inherent locality or computational complexity. To address this issue, we
propose T-Mamba, integrating shared positional encoding and frequency-based
features into vision mamba, to address limitations in spatial position
preservation and feature enhancement in frequency domain. Besides, we also
design a gate selection unit to integrate two features in spatial domain and
one feature in frequency domain adaptively. T-Mamba is the first work to
introduce frequency-based features into vision mamba. Extensive experiments
demonstrate that T-Mamba achieves new SOTA results on the public Tooth CBCT
dataset and outperforms previous SOTA methods by a large margin, i.e., IoU +
3.63
models are publicly available at https://github.com/isbrycee/T-Mamba.
更多查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要