MONET: Multiview Semi-supervised Keypoint via Epipolar Divergence

Yao, Y.; Jafarian, Y.; Park, H.S.

doi:10.1109/ICCV.2019.00084

Citation Details

MONET: Multiview Semi-supervised Keypoint via Epipolar Divergence

This paper presents MONET -- an end-to-end semi-supervised learning framework for a keypoint detector using multiview image streams. In particular, we consider general subjects such as non-human species where attaining a large scale annotated dataset is challenging. While multiview geometry can be used to self-supervise the unlabeled data, integrating the geometry into learning a keypoint detector is challenging due to representation mismatch. We address this mismatch by formulating a new differentiable representation of the epipolar constraint called epipolar divergence---a generalized distance from the epipolar lines to the corresponding keypoint distribution. Epipolar divergence characterizes when two view keypoint distributions produce zero reprojection error. We design a twin network that minimizes the epipolar divergence through stereo rectification that can significantly alleviate computational complexity and sampling aliasing in training. We demonstrate that our framework can localize customized keypoints of diverse species, e.g., humans, dogs, and monkeys. more »

Award ID(s):: 1846031

PAR ID:: 10159651

Author(s) / Creator(s):: Yao, Y.; Jafarian, Y.; Park, H.S.

Date Published:: 2019-01-01

Journal Name:: International conference on Computer Vision

ISSN:: 2473-9936

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/ICCV.2019.00084

More Like this