Adaptation and Frontend Features to Improve Naturalness in Found-Data Synthesis

Cooper, Erica; Hirschberg, Julia

doi:10.21437/SpeechProsody.2018-160

Citation Details

Adaptation and Frontend Features to Improve Naturalness in Found-Data Synthesis

We compare two approaches for training statistical parametric voices that make use of acoustic and prosodic features at the utterance level with the aim of improving naturalness of the resultant voices -- subset adaptation, and adding new acoustic and prosodic features at the frontend. We have found that the approach of labeling high, middle, or low values for a given feature at the frontend and then choosing which setting to use at synthesis time can produce voices rated as significantly more natural than a baseline voice that uses only the standard contextual frontend features, for both HMM-based and neural network-based synthesis. more »

Award ID(s):: 1717680

PAR ID:: 10097219

Author(s) / Creator(s):: Cooper, Erica; Hirschberg, Julia

Date Published:: 2018-06-13

Journal Name:: Speech Prosody 2018

Volume:: 1

Page Range / eLocation ID:: 794 to 798

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.21437/SpeechProsody.2018-160

More Like this