Deep Sliding Shapes for Amodal 3D Object Detection in RGB-D Images

2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)(2016)

引用 821|浏览128
暂无评分
摘要
We focus on the task of amodal 3D object detection in RGB-D images, which aims to produce a 3D bounding box of an object in metric form at its full extent. We introduce Deep Sliding Shapes, a 3D ConvNet formulation that takes a 3D volumetric scene from a RGB-D image as input and outputs 3D object bounding boxes. In our approach, we propose the first 3D Region Proposal Network (RPN) to learn objectness from geometric shapes and the first joint Object Recognition Network (ORN) to extract geometric features in 3D and color features in 2D. In particular, we handle objects of various sizes by training an amodal RPN at two different scales and an ORN to regress 3D bounding boxes. Experiments show that our algorithm outperforms the state-of-the-art by 13.8 in mAP and is 200× faster than the original Sliding Shapes.
更多
查看译文
关键词
deep sliding shapes,amodal 3D object detection,RGB-D images,3D ConvNet formulation,3D volumetric scene,3D object bounding boxes,3D region proposal network,RPN,geometric shapes,object recognition network,ORN,geometric feature extraction,color features,object learning,3D convolutional neural networks
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要