Adaptation and Frontend Features to Improve Naturalness in Found-Data Synthesis

Cooper, Erica; Hirschberg, Julia

Citation Details

We compare two approaches for training statistical parametric voices that make use of acoustic and prosodic features at the utterance level with the aim of improving naturalness of the resultant voices – subset adaptation, and adding new acous- tic and prosodic features at the frontend. We have found that the approach of labeling high, middle, or low values for a given feature at the frontend and then choosing which setting to use at synthesis time can produce voices rated as significantly more natural than a baseline voice that uses only the standard contextual frontend features, for both HMM-based and neural network-based synthesis more »

Award ID(s):: 1717680

PAR ID:: 10058671

Author(s) / Creator(s):: Cooper, Erica; Hirschberg, Julia

Date Published:: 2018-06-13

Journal Name:: Proceedings of Speech Prosody 2018

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this