Seeing the Unseen: Predicting the First-Person Camera Wearer’s Location and Pose in Third-Person Scenes

Wen, Yangming; Singh, Krishna Kumar; Anderson, Markham; Jan, Wei-Pang; Lee, Yong Jae

doi:10.1109/iccvw54120.2021.00384

Citation Details

Seeing the Unseen: Predicting the First-Person Camera Wearer’s Location and Pose in Third-Person Scenes

Our goal is to predict the camera wearer’s location and pose in his/her environment based on what’s captured by the camera wearer’s first-person wearable camera. Toward this goal, we first collect a new dataset in which the camera wearer performs various activities (e.g., opening a fridge, reading a book) in different scenes with time-synchronized first-person and stationary third-person cameras. We then propose a novel deep network architecture, which takes as input the first-person video frames and empty third-person scene image (without the camera wearer) to predict the location and pose of the camera wearer. We explore and compare our approach with several intuitive baselines and show initial promising results on this novel, challenging problem. more »

Award ID(s):: 2204808

PAR ID:: 10385710

Author(s) / Creator(s):: Wen, Yangming; Singh, Krishna Kumar; Anderson, Markham; Jan, Wei-Pang; Lee, Yong Jae

Date Published:: 2021-10-01

Journal Name:: International Conference on Computer Vision Workshops (ICCVW)

Page Range / eLocation ID:: 3439 to 3448

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/iccvw54120.2021.00384

More Like this