Dynamic Metric Learning: Towards a Scalable Metric Space to Accommodate Multiple Semantic Scales

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021(2021)

引用 15|浏览41989
暂无评分
摘要
This paper introduces a new fundamental characteristic, i.e., the dynamic range, from real-world metric tools to deep visual recognition. In metrology, the dynamic range is a basic quality of a metric tool, indicating its flexibility to accommodate various scales. Larger dynamic range offers higher flexibility. In visual recognition, the multiple scale problem also exist. Different visual concepts may have different semantic scales. For example, "Animal" and "Plants" have a large semantic scale while "Elk" has a much smaller one. Under a small semantic scale, two different elks may look quite different to each other . However; under a large semantic scale (e.g., animals and plants), these two elks should be measured as being similar. Introducing the dynamic range to deep metric learning, we get a novel computer vision task, i.e., the Dynamic Metric Learning. It aims to learn a scalable metric space to accommodate visual concepts across multiple semantic scales. Based on three types of images, i.e., vehicle, animal and online products, we construct three datasets for Dynamic Metric Learning. We benchmark these datasets with popular deep metric learning methods and find Dynamic Metric Learning to be very challenging. The major difficulty lies in a conflict between different scales: the discriminative ability under a small scale usually compromises the discriminative ability under a large one, and vice versa. As a minor contribution, we propose Cross-Scale Learning (CSL) to alleviate such conflict. We show that CSL consistently improves the baseline on all the three datasets.
更多
查看译文
关键词
deep visual recognition,metric tool,larger dynamic range,multiple scale problem,semantic scales,semantic scale,dynamic metric learning,scalable metric space,popular deep metric learning methods,Cross-Scale Learning,accommodate multiple semantic scales,real-world metric tools,visual concepts
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要