Skip to content

Detail of publication

Citation

Matoušek, J. and Tihelka, D. and Hanzlíček, Z. : Reducing Footprint of Unit Selection TTS System by Excluding Utterances from Source Speech Corpus . 19th Czech-German Workshop on Speech Processing, p. 92-98, Prague, 2009.

Download PDF

PDF

Abstract

Current unit selection speech synthesis systems are capable of producing speech of a high quality at the expense of enormous computational and storage requirements. In this paper, the analysis of an existing large speech corpus employed for unit-selection-based synthesis of Czech speech is performed. Subsequently, a procedure for the exclusion of some amount of utterances from the source speech corpus is proposed. The procedure is based on the statistics of the utilisation of all utterances during text-to-speech synthesis of a large portion of texts. The exclusion of whole utterances was preferred over the exclusion of the particular instances of speech units in order to preserve the main feature of unit selection framework - to select as longest sequence of contiguous speech units as possible. After the exclusion, the footprint of the system was reduced approximately by 42 %. The resulting synthetic speech was then judged by means of 5-scale CCR listening tests and evaluated in average as only "slightly worse" than speech generated by the baseline (i.e. not reduced) system.

Detail of publication

Title: Reducing Footprint of Unit Selection TTS System by Excluding Utterances from Source Speech Corpus
Author: Matoušek, J. ; Tihelka, D. ; Hanzlíček, Z.
Language: English
Date of publication: 1 Oct 2009
Year: 2009
Type of publication: Papers in proceedings of reviewed conferences
Book title: 19th Czech-German Workshop on Speech Processing
Page: 92 - 98
ISBN: 978-80-86269-18-4
Address: Prague
Date: 29 Sep 2009 - 1 Oct 2009
/ 2013-03-04 11:05:02 /

BibTeX

@INPROCEEDINGS{MatousekJ_2009_ReducingFootprintof,
 author = {Matou\v{s}ek, J. and Tihelka, D. and Hanzl\'{i}\v{c}ek, Z.},
 title = {Reducing Footprint of Unit Selection TTS System by Excluding Utterances from Source Speech Corpus},
 year = {2009},
 address = {Prague},
 pages = {92-98},
 booktitle = {19th Czech-German Workshop on Speech Processing},
 ISBN = {978-80-86269-18-4},
 url = {http://www.kky.zcu.cz/en/publications/MatousekJ_2009_ReducingFootprintof},
}