Learning Correspondence from the Cycle-Consistency of Time

Wang, Xiaolong; Jabri, Allan; Efros, Alexei A.

Citation Details

We introduce a self-supervised method for learning visual correspondence from unlabeled video. The main idea is to use cycle-consistency in time as free supervisory signal for learning visual representations from scratch. At training time, our model optimizes a spatial feature representation to be useful for performing cycle-consistent tracking. At test time, we use the acquired representation to find nearest neighbors across space and time. We demonstrate the generalizability of the representation across a range of visual correspondence tasks, including video object segmentation, keypoint tracking, and optical flow. Our approach outperforms previous self-supervised methods and performs competitively with strongly supervised methods. Overall, we find that the learned representation generalizes surprisingly well, despite being trained only on indoor videos and without fine-tuning. more »

Award ID(s):: 1633310

NSF-PAR ID:: 10111415

Author(s) / Creator(s):: Wang, Xiaolong; Jabri, Allan; Efros, Alexei A.

Date Published:: 2019-06-01

Journal Name:: IEEE Conference on Computer Vision and Pattern Recognition

ISSN:: 2163-6648

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
The DOI is not currently available.

More Like this