An RGB-D Descriptor for Object Classification

ROMANIAN JOURNAL OF INFORMATION SCIENCE AND TECHNOLOGY(2022)

引用 0|浏览1
暂无评分
摘要
One of the main and active research areas in computer vision is the object detection which has various applications including image retrieval, video surveillance, robotics, etc. The main problem of object detection is, detecting instances of semantic objects of predefined classes (such as pedestrians, faces, or cars) in 2D images and videos. As 2D images of the objects include information about object appearance, most of the methods rely on pattern detection algorithms using appearance-based or feature-based techniques. Although the availability of 3D image data by using inexpensive depth cameras has made the problem more tractable, many researchers still tend to use similar concepts applied to the 2D instance problem. In this paper, we aim to develop a 3D descriptor that exploits the information in 3D data to address the many difficulties associated with object detection. This method adds depth information to Bag of Visual Words' feature extraction part which is a novel approach in the literature. The proposed 3D descriptor eliminates the disadvantages of brightness-based problems and improves the structure with depth information. This improvement gives better accuracy results compared to the original method providing a rational and useful method for 3D object detection.
更多
查看译文
关键词
3D descriptor, bag of visual words, computer vision, depth image, object detection, RGB-D
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要