Subset Selection, Adaptation, Gemination and Prosody Prediction for Amharic Text-to-Speech Synthesis

Tesfaye Biru, Elshadai; Tofik Mohammed, Yishak; Tofu, David; Cooper, Erica; Hirschberg, Julia

doi:10.21437/SSW.2019-37

Citation Details

Subset Selection, Adaptation, Gemination and Prosody Prediction for Amharic Text-to-Speech Synthesis

While large TTS corpora exist for commercial sys- tems created for high-resource languages such as Man- darin, English, and Spanish, for many languages such as Amharic, which are spoken by millions of people, this is not the case. We are working with “found” data collected for other purposes (e.g. training ASR systems) or avail- able on the web (e.g. news broadcasts, audiobooks) to produce TTS systems for low-resource languages which do not currently have expensive, commercial systems. This study describes TTS systems built for Amharic from “found” data and includes systems built from di erent acoustic-prosodic subsets of the data, systems built from combined high and lower quality data using adaptation, and systems which use prediction of Amharic gemination to improve naturalness as perceived by evaluators. more »

Award ID(s):: 1717680

PAR ID:: 10174744

Author(s) / Creator(s):: Tesfaye Biru, Elshadai; Tofik Mohammed, Yishak; Tofu, David; Cooper, Erica; Hirschberg, Julia

Date Published:: 2019-09-20

Journal Name:: 10th ISCA Speech Synthesis Workshop

Page Range / eLocation ID:: 205 to 210

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.21437/SSW.2019-37

More Like this