Skip to content

Detail of publication

Citation

Matoušek, J. : Automatic Segmentation of Parasitic Sounds in Speech Corpora for TTS Synthesis . Text, Speech and Dialogue, proceedings of the 13th International Conference TSD 2010, Lecture Notes in Artificial Intelligence, p. 369-376, Springer, Berlin-Heidelberg, Germany, 2010.

Additional information


Springerlink

Abstract

In this paper, automatic segmentation of parasitic speech sounds in speech corpora for text-to-speech (TTS) synthesis is presented. The automatic segmentation is, beside the automatic detection of the presence of such sounds in speech corpora, an important step in the precise localisation of parasitic sounds in speech corpora. The main goal of this study is to find out whether the segmentation of these sounds is accurate enough to enable cutting the sounds out of synthetic speech or explicit modelling of these sounds during synthesis. HMM-based classifier was employed to detect the parasitic sounds and to find the boundaries between these sounds and the surrounding phones simultaneously. The results show that the automatic segmentation of parasitic sounds is comparable to the segmentation of other phones, which indicates that the cutting out or the explicit usage of parasitic sounds should be possible.

Detail of publication

Title: Automatic Segmentation of Parasitic Sounds in Speech Corpora for TTS Synthesis
Author: Matoušek, J.
Language: English
Date of publication: 6 Sep 2010
Year: 2010
Type of publication: Papers in journals
Book title: Text, Speech and Dialogue, proceedings of the 13th International Conference TSD 2010
Series: Lecture Notes in Artificial Intelligence
Page: 369 - 376
ISBN: 3-642-15759-9
ISSN: 0302-9743
Publisher: Springer
Address: Berlin-Heidelberg, Germany
Date: 6 Sep 2010 - 10 Sep 2010
/ 2011-03-15 17:43:36 /

Keywords

parasitic speech sound, speech synthesis, unit selection, HMM, automatic phonetic segmentation

BibTeX

@INPROCEEDINGS{MatousekJ_2010_Automatic,
 author = {Matou\v{s}ek, J.},
 title = {Automatic Segmentation of Parasitic Sounds in Speech Corpora for TTS Synthesis},
 year = {2010},
 publisher = {Springer},
 address = {Berlin-Heidelberg, Germany},
 pages = {369-376},
 booktitle = {Text, Speech and Dialogue, proceedings of the 13th International Conference TSD 2010},
 series = {Lecture Notes in Artificial Intelligence},
 ISBN = {3-642-15759-9},
 ISSN = {0302-9743},
 url = {http://www.kky.zcu.cz/en/publications/MatousekJ_2010_Automatic},
}