Image-to-video person re-identification using three-dimensional semantic appearance alignment and cross-modal interactive learning

Pattern Recognition(2022)

引用 5|浏览18
暂无评分
摘要
•A deep image-to-video person re-identification pipeline with two modules is proposed to learn fine-grained and temporal invariant features.•To address the appearance misalignment, a 3D-SAA module is designed to semantically align different human body parts in the 3D surface space.•To address the modality misalignment, a CMIL module is developed to fuse two modalities with an interactive similarity comparison mechanism.•A multi-branch aggregation network in 3D-SAA module is designed to weaken the influence of negligible body parts and backgrounds.
更多
查看译文
关键词
Person re-identification,Cross-modal learning,Appearance alignment
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要