An adaptive cross-scale transformer based on graph signal processing for person re-identification.

IET Image Process.(2023)

引用 0|浏览4
暂无评分
摘要
Extracting robust feature representation is one of the key challenges for person re-identification (ReID) task. Although convolution neural network (CNN)-based methods have achieved great success, they still cannot handle the part occlusion and misalignment caused by limited receptive field. Recently, pure transformer models have shown its power in the person ReID task. However, current transformer models adopt patches of equal-scale as input, and cannot solve the problem of cross-scale interaction properly. To overcome this problem, an adaptive cross-scale transformer from a perspective of the graph signal, named ACSFormer, is proposed. Specifically, the self-attention module is first treated as an undirected fully connected graph. And then, "node variation" is introduced as an indicator to adaptively merge neighbourhood tokens. To the best of the authors' knowledge, their ACSFormer is the first work to attempt to combine pure transformers and graph signal processing in the field of person ReID. Extensive evaluations are conducted on three person ReID datasets to validate the performance of ACSFormer. Experiments demonstrate that this ACSFormer performs on par with state-of-the-art CNN-based methods and consistently improves transformer-based baseline, for example, surpassing ViT-baseline by 2.5%, 2.7% and 4.8% mAP on Market1501, DukeMTMC-reID and MSMT17, respectively.
更多
查看译文
关键词
cross-scale interaction, graph signal processing, person re-identification, pure transformer model
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要