MOTSLAM: MOT-assisted monocular dynamic SLAM using single-view depth estimation.

IEEE/RJS International Conference on Intelligent RObots and Systems (IROS)(2022)

引用 5|浏览25
暂无评分
摘要
Visual SLAM systems targeting static scenes have been developed with satisfactory accuracy and robustness. Dynamic 3D object tracking has then become a significant capability in visual SLAM with the requirement of understanding dynamic surroundings in various scenarios including autonomous driving, augmented and virtual reality. However, performing dynamic SLAM solely with monocular images remains a challenging problem due to the difficulty of associating dynamic features and estimating their positions. In this paper, we present MOTSLAM, a dynamic visual SLAM system with the monocular configuration that tracks both poses and bounding boxes of dynamic objects. MOTSLAM first performs multiple object tracking (MOT) with associated both 2D and 3D bounding box detection to create initial 3D objects. Then, neural-network-based monocular depth estimation is applied to fetch the depth of dynamic features. Finally, camera poses, object poses, and both static, as well as dynamic map points, are jointly optimized using a novel bundle adjustment. Our experiments on the KITTI dataset demonstrate that our system has reached best performance on both camera ego-motion and object tracking on monocular dynamic SLAM.
更多
查看译文
关键词
3D bounding box detection,asso-ciating dynamic features,autonomous driving,bounding boxes,dynamic 3D object tracking,dynamic map points,dynamic objects,dynamic SLAM solely,dynamic surroundings,dynamic visual SLAM system,initial 3D objects,monocular configuration,monocular images,MOT-assisted monocular dynamic SLAM,MOTSLAM,multiple object tracking,neural-network-based monocular depth estimation,robustness,satisfactory accuracy,significant capability,single-view depth estimation,static scenes,visual SLAM systems
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要