MusicFace: Music-driven expressive singing face synthesis

Liu, Pengfei; Deng, Wenjin; Li, Hengda; Wang, Jintai; Zheng, Yinglin; Ding, Yiwei; Guo, Xiaohu; Zeng, Ming

doi:10.1007/s41095-023-0343-7

Citation Details

MusicFace: Music-driven expressive singing face synthesis

Abstract It remains an interesting and challenging problem to synthesize a vivid and realistic singing face driven by music. In this paper, we present a method for this task with natural motions for the lips, facial expression, head pose, and eyes. Due to the coupling of mixed information for the human voice and backing music in common music audio signals, we design a decouple-and-fuse strategy to tackle the challenge. We first decompose the input music audio into a human voice stream and a backing music stream. Due to the implicit and complicated correlation between the two-stream input signals and the dynamics of the facial expressions, head motions, and eye states, we model their relationship with an attention scheme, where the effects of the two streams are fused seamlessly. Furthermore, to improve the expressivenes of the generated results, we decompose head movement generation in terms of speed and direction, and decompose eye state generation into short-term blinking and long-term eye closing, modeling them separately. We have also built a novel dataset, SingingFace, to support training and evaluation of models for this task, including future work on this topic. Extensive experiments and a user study show that our proposed method is capable of synthesizing vivid singing faces, qualitatively and quantitatively better than the prior state-of-the-art. more »

Award ID(s):: 2007661

PAR ID:: 10558869

Author(s) / Creator(s):: Liu, Pengfei; Deng, Wenjin; Li, Hengda; Wang, Jintai; Zheng, Yinglin; Ding, Yiwei; Guo, Xiaohu; Zeng, Ming

Publisher / Repository:: Springer

Date Published:: 2024-02-01

Journal Name:: Computational Visual Media

Volume:: 10

Issue:: 1

ISSN:: 2096-0433

Page Range / eLocation ID:: 119 to 136

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1007/s41095-023-0343-7

More Like this