Pseudo Dataset Generation for Out-of-domain Multi-Camera View Recommendation

Lee, Kuan-Ying; Zhou, Qian; Nahrstedt, Klara

doi:10.1109/VCIP63160.2024.10849905

Citation Details

Pseudo Dataset Generation for Out-of-domain Multi-Camera View Recommendation

Multi-camera systems are indispensable in movies, TV shows, and other media. Selecting the appropriate camera at every timestamp has a decisive impact on production quality and audience preferences. Learning-based view recommendation frameworks can assist professionals in decision-making. However, they often struggle outside of their training domains. The scarcity of labeled multi-camera view recommendation datasets exacerbates the issue. Based on the insight that many videos are edited from the original multi-camera videos, we propose transforming regular videos into pseudo-labeled multi-camera view recommendation datasets. Promisingly, by training the model on pseudo-labeled datasets stemming from videos in the target domain, we achieve a 68% relative improvement in the model’s accuracy in the target domain and bridge the accuracy gap between in-domain and never-before-seen domains. more »

Award ID(s):: 1900875 2106592

PAR ID:: 10635556

Author(s) / Creator(s):: Lee, Kuan-Ying; Zhou, Qian; Nahrstedt, Klara

Publisher / Repository:: IEEE

Date Published:: 2024-12-08

ISBN:: 979-8-3315-2954-3

Page Range / eLocation ID:: 1 to 5

Subject(s) / Keyword(s):: Training Accuracy TV Visual communication Decision making Production Cinematography Media Motion pictures Videos

Format(s):: Medium: X

Location:: Tokyo, Japan

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/VCIP63160.2024.10849905

More Like this