Variational Autoencoder for 3D Voxel Compression

2020 35th International Conference on Image and Vision Computing New Zealand (IVCNZ)(2020)

引用 2|浏览20
暂无评分
摘要
3D scene sensing and understanding is a fundamental task in the field of computer vision and robotics. One widely used representation for 3D data is a voxel grid. However, explicit representation of 3D voxels always requires large storage space, which is not suitable for light-weight applications and scenarios such as robotic navigation and exploration. In this paper we propose a method to compress 3D voxel grids using an octree representation and Variational Autoencoders (VAEs). We first capture a 3D voxel grid -in our application with collaborating Realsense D435 and T265 cameras. The voxel grid is decomposed into three types of octants which are then compressed by the encoder and reproduced by feeding the latent code into the decoder. We demonstrate the efficiency of our method by two applications: scene reconstruction and path planning.
更多
查看译文
关键词
path planning,scene reconstruction,T265 cameras,Realsense D435,VAEs,robotic exploration,3D data,3D scene sensing,variational autoencoder,octree representation,3D voxel grids,robotic navigation,light-weight applications,explicit representation,computer vision,3D voxel compression
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要