Deep representation of industrial components using simulated images.

ICRA(2017)

引用 5|浏览24
暂无评分
摘要
In this paper, we present a visual learning framework to retrieve a 3D model and estimate its pose from a single image. To increase the quantity and quality of training data, we define our simulation space in the near infrared (NIR) band, and utilize the quasi-Monte Carlo (MC) method for scalable photorealistic rendering of manufactured components. Two types of convolutional neural network (CNN) architectures are trained over these synthetic data and a relatively small amount of real data. The first CNN model seeks the most discriminative information and uses it to classify industrial components with fine-grained shape attributes. Once a 3D model is identified, one of the category-specific CNNs is tested for pose regression in the second phase. The mixed data for learning object categories is useful in domain adaptation and attention mechanism in our system. We validate our data-driven method with 88 component models, and the experimental results are qualitatively demonstrated. Also, the CNNs trained with various conditions of mixed data are quantitatively analyzed.
更多
查看译文
关键词
industrial components,simulated images,deep representation,visual learning framework,3D model retrieval,pose estimation,near infrared band,NIR band,quasiMonte Carlo method,quasiMC method,scalable photorealistic rendering,convolutional neural network architectures,CNN architectures,synthetic data,fine-grained shape attributes,category-specific CNN,pose regression,domain adaptation,attention mechanism,qualitative analysis,CNN training,quantitative analysis
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要