AENet: attention efficient network for cross-view image geo-localization

ELECTRONIC RESEARCH ARCHIVE(2023)

引用 0|浏览0
暂无评分
摘要
To address the problem that task-irrelevant objects such as cars, pedestrians and sky, will in-terfere with the extracted feature descriptors in cross-view image geo-localization, this paper proposes a novel method for cross-view image geo-localization, named as AENet. The method includes two main parts: an attention efficient network fusing channel and spatial attention mechanisms and a triplet loss function based on a multiple hard samples weighting strategy. In the first part, the EfficientNetV2 network is used to extract features from the images and preliminarily filter irrelevant features from the channel dimension, then the Triplet Attention layer is applied to further filter irrelevant features from the spatial dimension. In the second part, a multiple hard samples weighting strategy is proposed to en-hance the learning of hard samples. Experimental results show that our proposed method significantly outperforms the state-of-the-art method on two existing benchmark datasets.
更多
查看译文
关键词
cross-view image geo-localization,attention mechanism,filter,hard sample
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要