Predicting Camera Viewpoint Improves Cross-dataset Generalization for 3D Human Pose Estimation

Wang, Z; Shin, D; Fowlkes, C

Citation Details

Monocular estimation of 3d human pose has attracted in- creased attention with the availability of large ground-truth motion capture datasets. However, the diversity of training data available is limited and it is not clear to what extent methods generalize outside the specific datasets they are trained on. In this work we carry out a systematic study of the diversity and biases present in specific datasets and its e↵ect on cross-dataset generalization across a compendium of 5 pose datasets. We specifically focus on systematic di↵erences in the distri- bution of camera viewpoints relative to a body-centered coordinate frame. Based on this observation, we propose an auxiliary task of predicting the camera viewpoint in addition to pose. We find that models trained to jointly predict viewpoint and pose systematically show significantly improved cross-dataset generalization. more »

Award ID(s):: 1813785

NSF-PAR ID:: 10296118

Author(s) / Creator(s):: Wang, Z; Shin, D; Fowlkes, C

Date Published:: 2021-10-11

Journal Name:: IEEE International Conference on Computer Vision workshops

ISSN:: 2473-9936

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this