MONET: Multiview Semi-Supervised Keypoint Detection via Epipolar Divergence

2019 IEEE/CVF International Conference on Computer Vision (ICCV)(2019)

引用 51|浏览4
暂无评分
摘要
This paper presents MONET-an end-to-end semi-supervised learning frameworkfor a keypoint detector using multiview image streams. In particular, we consider general subjects such as non-human species where attaining a large scale annotated dataset is challenging. While multiview geometry can be used to self-supervise the unlabeled data, integrating the geometry into learning a keypoint detector is challenging due to representation mismatch. We address this mismatch by formulating a new differentiable representation of the epipolar constraint called epipolar divergence-a generalized distance from the epipolar lines to the corresponding keypoint distribution. Epipolar divergence characterizes when two view keypoint distributions produce zero reprojection error. We design a twin network that minimizes the epipolar divergence through stereo rectification that can significantly alleviate computational complexity and sampling aliasing in training. We demonstrate that our framework can localize customized keypoints of diverse species, e.g., humans, dogs, and monkeys.
更多
查看译文
关键词
MONET-an end-to-end semisupervised learning,keypoint detector,multiview image streams,general subjects,nonhuman species,multiview geometry,representation mismatch,differentiable representation,epipolar constraint,epipolar divergence-a,epipolar lines,corresponding keypoint distribution,view keypoint distributions,semisupervised keypoint detection
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要