Improving Multi-Camera View Recommendation with Temporal and Camera Embedding

Lee, Kuan-Ying; Zhou, Qian; Nahrstedt, Klara

doi:10.1109/IE64880.2025.11130132

Citation Details

This content will become publicly available on June 23, 2026

Improving Multi-Camera View Recommendation with Temporal and Camera Embedding

Multi-camera systems are essential in movies, live broadcasts, and other media. The selection of the appropriate camera for every moment has a decisive impact on production quality and audience preferences. Learning-based multi-camera view recommendation frameworks have been explored to assist professionals in decision making. This work explores how two standard cinematography practices could be incorporated into the learning pipeline: (1) not staying on the same camera for too long and (2) introducing a scene from a wider shot and gradually progressing to narrower ones. In these regards, we incorporate (1) the duration of the displaying camera and (2) camera identity as temporal and camera embedding in a transformer architecture, thereby implicitly guiding the model to learn the two practices from professional-labeled data. Experiments show that the proposed framework outperforms the baseline by 14.68% in six-way classification accuracy. Ablation studies on different approaches to embedding the temporal and camera information further verify the efficacy of the framework. more »

Award ID(s):: 1900875 2106592

PAR ID:: 10635554

Author(s) / Creator(s):: Lee, Kuan-Ying; Zhou, Qian; Nahrstedt, Klara

Publisher / Repository:: IEEE

Date Published:: 2025-06-23

ISBN:: 979-8-3315-2358-9

Page Range / eLocation ID:: 1 to 5

Subject(s) / Keyword(s):: Transformer Temporal Information Filming Learning-based Framework Multi-camera System Audience Preferences Heuristic Cross-entropy Loss Film Production Binary Cross Entropy Binary Cross-entropy Loss Embedding Learning Video Editing Position Embedding Multilayer Perception Past Frames

Format(s):: Medium: X

Location:: Darmstadt, Germany

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on June 23, 2026
Conference Paper:
https://doi.org/10.1109/IE64880.2025.11130132

More Like this