ZippyPoint: Fast Interest Point Detection, Description, and Matching through Mixed Precision Discretization

arxiv(2023)

引用 5|浏览14
暂无评分
摘要
Efficient detection and description of geometric regions in images is a prerequisite in visual systems for localization and mapping. Such systems still rely on traditional hand-crafted methods for efficient generation of lightweight descriptors, a common limitation of the more powerful neural network models that come with high compute and specific hardware requirements. In this paper, we focus on the adaptations required by detection and description neural networks to enable their use in computationally limited platforms such as robots, mobile, and augmented reality devices. To that end, we investigate and adapt network quantization techniques to accelerate inference and enable its use on compute limited platforms. In addition, we revisit common practices in descriptor quantization and propose the use of a binary descriptor normalization layer, enabling the generation of distinctive binary descriptors with a constant number of ones. ZippyPoint, our efficient quantized network with binary descriptors, improves the network runtime speed, the descriptor matching speed, and the 3D model size, by at least an order of magnitude when compared to full-precision counterparts. These improvements come at a minor performance degradation as evaluated on the tasks of homography estimation, visual localization, and map-free visual relocalization. Code and models are available at https://github.com/menelaoskanakis/ZippyPoint.
更多
查看译文
关键词
3D model size,binary descriptor normalization layer,common limitation,common practices,computationally limited platforms,compute limited platforms,descriptor matching speed,descriptor quantization,distinctive binary descriptors,efficient quantized network,fast interest point detection,full-precision counterparts,geometric regions,high compute,lightweight descriptors,map-free visual relocalization,mixed precision discretization,network quantization techniques,network runtime speed,powerful neural network models,reality devices,specific hardware requirements,traditional handcrafted methods,visual localization,visual systems
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要