MOVES: Manipulated Objects in Video Enable Segmentation

Higgins, Richard E.L.; Fouhey, David F.

doi:10.1109/CVPR52729.2023.00613

Citation Details

MOVES: Manipulated Objects in Video Enable Segmentation

Our method uses manipulation in video to learn to understand held-objects and hand-object contact. We train a system that takes a single RGB image and produces a pixel-embedding that can be used to answer grouping questions (do these two pixels go together) as well as hand-association questions (is this hand holding that pixel). Rather than painstakingly annotate segmentation masks, we observe people in realistic video data. We show that pairing epipolar geometry with modern optical flow produces simple and effective pseudo-labels for grouping. Given people segmentations, we can further associate pixels with hands to understand contact. Our system achieves competitive results on hand and hand-held object tasks. more »

Award ID(s):: 2006619

PAR ID:: 10469274

Author(s) / Creator(s):: Higgins, Richard E.L.; Fouhey, David F.

Publisher / Repository:: CVPR

Date Published:: 2023-06-18

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/CVPR52729.2023.00613

More Like this