Local-to-Global Semantic Learning for Multi-View 3D Object Detection from Point Cloud

Renzhong Qiao,Hongbing Ji,Zhigang Zhu, Wenbo Zhang

IEEE Transactions on Circuits and Systems for Video Technology(2024)

引用 0|浏览0
暂无评分
摘要
LiDAR, as an excellent sensor, can provide positions, motion states, and other objective attribute information of objects in the 3D world. Inevitably, the inherent sparsity of point cloud and the problem of occlusion tend to cause incomplete semantic and geometry information of long-range small objects, posing challenges to 3D object detection. The multi-view models take advantage of the complementary information among bird’s eye view (BEV), range view (RV), and other views to alleviate the above issues. However, most of the existing methods coarsely learn the views’ features and neglect the learning of semantic information, which further leads to unsatisfactory detection performance. To this end, this paper proposes a Local-to-Global Semantic Learning Network (LGSLNet) for multi-view 3D object detection from point cloud. The proposed LGSLNet can effectively learn semantic information to explore the local semantics contained in various channels of RV features and to fuse them with BEV features. It has two branches with different backbones. In the BEV branch, the voxels quantized from the point cloud are extracted by sparse convolutional networks and compressed to BEV features. In the RV branch, a multi-scale backbone with semantic-aware convolution (SAC) is designed to learn the local semantic information of the RV. It allows for adaptation to the 3D location using the auxiliary network. In the fusion module, the bidirectional cross-view channel attention (Bi-CCA) is designed to compensate for the semantic information between multiple views and aggregate new RV and BEV features. Extensive experiments on the KITTI, ONCE, and nuScenes 3D object detection datasets demonstrate the superiority of our proposed method.
更多
查看译文
关键词
Multi-view 3D object detection,point cloud,local-to-global semantic learning,long-range small objects
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要