Special attribute-based cross-modal interactive fusion network for RGBT tracking

XIAOQIANG SHAO,Hao Li, Zhiyue Lu,Ma Bo, Liu ming qian, HAN ZeHui

crossref(2024)

Cited 0|Views1
No score
Abstract
Due to the strong robustness of RGBT object tracking, which is less susceptible to the effects of illumination and occlusion, it has been widely used in the fields of video surveillance and automated driving. In this paper, an effective tracking network is constructed by fully interacting with both modal information using challenging attributes in infrared and visible images. The network consists of three parts: the special attribute fusion(SAF) module, the common attribute fusion(CAF) module, and the cross-modal interaction(CMI) module. The SAF module enables the network to extract unique challeng attribute information from two modalities, fully leveraging the advantages of different modal information. The CAF module extracts features from attributes matched in both modalities, and adaptively aggregates them, assigning corresponding weights to each challenging attribute to enhance the tracker's adaptability. The CMI module facilitates modal interaction between the infrared and visible image modalities, integrating common modal information with the specific modal information of each, thereby enhancing the network's robustness. The proposed network is tested on GTOT, RGBT234, and LasHer datasets, respectively. The results show that our tracker outperforms other trackers, proving the superiority of our method.
More
Translated text
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined