Attributable Watermarking of Speech Generative Models

Cho, Yongbaek; Kim, Changhoon; Yang, Yezhou; Ren, Yi

doi:10.1109/ICASSP43922.2022.9746578

Citation Details

Attributable Watermarking of Speech Generative Models

Generative models are now capable of synthesizing images, speeches, and videos that are hardly distinguishable from authentic contents. Such capabilities cause concerns such as malicious impersonation and IP theft. This paper investigates a solution for model attribution, i.e., the classification of synthetic contents by their source models via watermarks embedded in the contents. Building on past success of model attribution in the image domain, we discuss algorithmic improvements for generating user-end speech models that empirically achieve high attribution accuracy, while maintaining high generation quality. We show the tradeoff between attributability and generation quality under a variety of attacks on generated speech signals attempting to remove the watermarks, and the feasibility of learning robust watermarks against these attacks. more »

Award ID(s):: 2101052

PAR ID:: 10349551

Author(s) / Creator(s):: Cho, Yongbaek; Kim, Changhoon; Yang, Yezhou; Ren, Yi

Date Published:: 2022-01-01

Journal Name:: Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing

ISSN:: 2379-190X

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/ICASSP43922.2022.9746578

More Like this