Skip to content

Detail of publication

Citation

Grůber, M. and Matoušek, J. and Tihelka, D. and Hanzlíček, Z. : Reducing footprint of unit selection TTS system by removing linguistic segments with rarely selected units . 2014 IEEE 12th International Conference on Signal Processing Proceedings , vol. 1, p. 494-499, Institute of Electrical and Electronics Engineers, Inc., Beijing, China, 2014.

Abstract

This paper is focused on reducing the size of speech corpora that are used in the unit-selection-based TTS systems. The size of a speech corpus influences the system requirements like storage and memory demands and computational complexity. For high quality speech synthesis, the speech corpus usually consists of several thousands of sentences. Thus an appropriate reduction of the corpus size is likely to lead to a decrease in the system requirements. In this work, a comparison of impacts on synthetic speech quality is presented when removing specific instances of different linguistic segment types from the original corpus. Removal of the following segment types is used and compared with each other: whole sentences, phrases, words, and diphones. Only segments with rarely selected units are removed from the corpus so that the resulting footprint size reaches a predefined value. Results confirm that synthetic speech generated by the TTS systems using the reduced corpora is of a slightly worse quality when compared with speech produced by the system employing the original full corpus. The comparison of the reduction based on different linguistic segments is also presented here.

Detail of publication

Title: Reducing footprint of unit selection TTS system by removing linguistic segments with rarely selected units
Author: Grůber, M. ; Matoušek, J. ; Tihelka, D. ; Hanzlíček, Z.
Language: English
Date of publication: 19 Oct 2014
Year: 2014
Type of publication: Papers in proceedings of reviewed conferences
Title of journal or book: 2014 IEEE 12th International Conference on Signal Processing Proceedings
Číslo vydání: 1
Page: 494 - 499
DOI: 10.1109/ICOSP.2014.7015054
ISBN: 978-1-4799-2188-1
ISSN: 2164-5221
Publisher: Institute of Electrical and Electronics Engineers, Inc.
Address: Beijing, China
Date: 19 Oct 2014 - 23 Oct 2014
/ 2016-01-13 16:46:01 /

Keywords

speech synthesis, TTS, unit selection, reducing footprint

BibTeX

@INPROCEEDINGS{GruberM_2014_Reducingfootprintof,
 author = {Gr\r{u}ber, M. and Matou\v{s}ek, J. and Tihelka, D. and Hanzl\'{i}\v{c}ek, Z.},
 title = {Reducing footprint of unit selection TTS system by removing linguistic segments with rarely selected units },
 year = {2014},
 publisher = {Institute of Electrical and Electronics Engineers, Inc.},
 journal = {2014 IEEE 12th International Conference on Signal Processing Proceedings },
 address = {Beijing, China},
 volume = {1},
 pages = {494-499},
 ISBN = {978-1-4799-2188-1},
 ISSN = {2164-5221},
 doi = {10.1109/ICOSP.2014.7015054},
 url = {http://www.kky.zcu.cz/en/publications/GruberM_2014_Reducingfootprintof},
}