NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

MusicFace: Music-driven expressive singing face synthesis

https://doi.org/10.1007/s41095-023-0343-7

Liu, Pengfei; Deng, Wenjin; Li, Hengda; Wang, Jintai; Zheng, Yinglin; Ding, Yiwei; Guo, Xiaohu; Zeng, Ming (February 2024, Computational Visual Media)

Abstract It remains an interesting and challenging problem to synthesize a vivid and realistic singing face driven by music. In this paper, we present a method for this task with natural motions for the lips, facial expression, head pose, and eyes. Due to the coupling of mixed information for the human voice and backing music in common music audio signals, we design a decouple-and-fuse strategy to tackle the challenge. We first decompose the input music audio into a human voice stream and a backing music stream. Due to the implicit and complicated correlation between the two-stream input signals and the dynamics of the facial expressions, head motions, and eye states, we model their relationship with an attention scheme, where the effects of the two streams are fused seamlessly. Furthermore, to improve the expressivenes of the generated results, we decompose head movement generation in terms of speed and direction, and decompose eye state generation into short-term blinking and long-term eye closing, modeling them separately. We have also built a novel dataset, SingingFace, to support training and evaluation of models for this task, including future work on this topic. Extensive experiments and a user study show that our proposed method is capable of synthesizing vivid singing faces, qualitatively and quantitatively better than the prior state-of-the-art.
more » « less
Full Text Available
Robust Ranking Explanations

Chen, Chao; Guo, Chenghua; Ma, Guixiang; Zeng, Ming; Zhang, Xi; Xie, Sihong (July 2023, Workshop on Interpretable Machine Learning in Healthcare at ICML 2023)

Robust explanations of machine learning models are critical to establish human trust in the models. Due to limited cognition capability, most humans can only interpret the top few salient features. It is critical to make top salient features robust to adversarial attacks, especially those against the more vulnerable gradient-based explanations. Existing defense measures robustness using lp norms, which have weaker protection power. We define explanation thickness for measuring salient features ranking stability, and derive tractable surrogate bounds of the thickness to design the R2ET algorithm to efficiently maximize the thickness and anchor top salient features. Theoretically, we prove a connection between R2ET and adversarial training. Experiments with a wide spectrum of network architectures and data modalities, including brain networks, demonstrate that R2ET attains higher explanation robustness under stealthy attacks while retaining accuracy.
more » « less
Full Text Available
Robust Ranking Explanations

Chen, Chao; Guo, Chenghua; Ma, Guixiang; Zeng, Ming; Zhang, Xi; Xie, Sihong (July 2023, Workshop on Interpretable Machine Learning in Healthcare at ICML 2023)

Full Text Available
The optimal beam-loading in two-bunch nonlinear plasma wakefield accelerators

https://doi.org/10.1088/1361-6587/ac6a10

Wang, Xiaoning; Gao, Jie; Su, Qianqian; Wang, Jia; Li, Dazhang; Zeng, Ming; Lu, Wei; Mori, Warren B; Joshi, Chan; An, Weiming (May 2022, Plasma Physics and Controlled Fusion)

Abstract Due to the highly nonlinear nature of the beam-loading, it is currently not possible to analytically determine the beam parameters needed in a two-bunch plasma wakefield accelerator for maintaining a low energy spread. Therefore in this paper, by using the Broyden–Fletcher–Goldfarb–Shanno algorithm for the parameter scanning with the code QuickPIC and the polynomial regression together with k -fold cross-validation method, we obtain two fitting formulas for calculating the parameters of tri-Gaussian electron beams when minimizing the energy spread based on the beam-loading effect in a nonlinear plasma wakefield accelerator. One formula allows the optimization of the normalized charge per unit length of a trailing beam to achieve the minimal energy spread, i.e. the optimal beam-loading. The other one directly gives the transformer ratio when the trailing beam achieves the optimal beam-loading. A simple scaling law for charges of drive beams and trailing beams is obtained from the fitting formula, which indicates that the optimal beam-loading is always achieved for a given charge ratio of the two beams when the length and separation of two beams and the plasma density are fixed. The formulas can also help obtain the optimal plasma densities for the maximum accelerated charge and the maximum acceleration efficiency under the optimal beam-loading respectively. These two fitting formulas will significantly enhance the efficiency for designing and optimizing a two-bunch plasma wakefield acceleration stage.
more » « less
Full Text Available
3D Talking Face with Personalized Pose Dynamics

https://doi.org/10.1109/TVCG.2021.3117484

Zhang, Chenxu; Ni, Saifeng; Fan, Zhipeng; Li, Hongbo; Zeng, Ming; Budagavi, Madhukar; Guo, Xiaohu (October 2021, IEEE Transactions on Visualization and Computer Graphics)

Full Text Available
FACIAL: Synthesizing Dynamic Talking Face with Implicit Attribute Learning

https://doi.org/10.1109/ICCV48922.2021.00384

Zhang, Chenxu; Zhao, Yifan; Huang, Yifei; Zeng, Ming; Ni, Saifeng; Budagavi, Madhukar; Guo, Xiaohu (October 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV))

Full Text Available
Carbon isotope effects in the artificial photosynthesis reactions catalyzed by nanostructured Co/CoO

https://doi.org/10.1016/j.cplett.2020.137731

Zeng, Ming; Kan, Zhe; Wang, Zibo; Shen, Mengyan (September 2020, Chemical Physics Letters)
null (Ed.)
Full Text Available
Delay Minimization for Massive MIMO Assisted Mobile Edge Computing

https://doi.org/10.1109/TVT.2020.2979434

Zeng, Ming; Hao, Wanming; Dobre, Octavia A.; Poor, H. Vincent (June 2020, IEEE Transactions on Vehicular Technology)

Full Text Available
Conversion of water and carbon dioxide into methanol with solar energy on Au/Co nanostructured surfaces

https://doi.org/10.1088/2053-1591/ab7d0e

Zhu, Qinghua; Wang, Cong; Ren, Haizhou; Zeng, Ming; Kan, Zhe; Wang, Zibo; Shen, Mengyan (March 2020, Materials Research Express)
null (Ed.)
Full Text Available
Low-cost visible-light photosynthesis of water and adsorbed carbon dioxide into long-chain hydrocarbons

https://doi.org/10.1016/j.cplett.2019.136985

Wang, Cong; Ren, Haizhou; Zeng, Ming; Zhu, Qinghua; Zhang, Qing; Kan, Zhe; Wang, Zibo; Shen, Mengyan; Thalavitiya Acharige, Mahesh J.; Ruths, Marina; et al (January 2020, Chemical Physics Letters)
null (Ed.)
Full Text Available

« Prev Next »

Search for: All records