Publications
Detail of publication
Citation
p. 365-373, Springer, 2014. : GMM Classification of Text-to-Speech Synthesis: Identification of Original Speaker’s Voice . Text, Speech, and Dialogue, 17th International Conference, TSD 2014, Brno, Czech Republic, September 8-12, 2014. Proceedings, Lecture Notes in Artificial Intelligence, vol. 8655,
Additional information
Abstract
This paper describes two experiments. The first one deals with evaluation of synthetic speech quality by reverse identification of original speakers whose voices had been used for several Czech text-to-speech (TTS) systems. The second experiment was aimed at evaluation of the influence of voice transformation on the original speaker recognition. The paper further describes an analysis of the influence of initial settings for creation and training of the Gaussian mixture models (GMM), and the influence of different types of used speech features (spectral and/or supra-segmental) on correctness of GMM identification. The stability of the identification process with respect to the duration of the tested sentence (number of the processed frames) was analysed, too.
Detail of publication
Title: | GMM Classification of Text-to-Speech Synthesis: Identification of Original Speaker’s Voice |
---|---|
Author: | Přibil, J ; Přibilová, A ; Matoušek, J. |
Language: | Czech |
Date of publication: | 8 Sep 2014 |
Year: | 2014 |
Type of publication: | Papers in proceedings of reviewed conferences |
Book title: | Text, Speech, and Dialogue, 17th International Conference, TSD 2014, Brno, Czech Republic, September 8-12, 2014. Proceedings |
Series: | Lecture Notes in Artificial Intelligence |
Číslo vydání: | 8655 |
Page: | 365 - 373 |
DOI: | 10.1007/978-3-319-10816-2_44 |
ISBN: | 978-3-319-10815-5 |
ISSN: | 0302-9743 |
Publisher: | Springer |
Date: | 8 Sep 2014 - 12 Sep 2014 |
Keywords
quality of synthetic speech, text-to-speech system, GMM classification, statistical analysis
BibTeX
@INCOLLECTION{PribilJ_2014_GMMClassificationof, author = {P\v{r}ibil, J and P\v{r}ibilov\'{a}, A and Matou\v{s}ek, J.}, title = {GMM Classification of Text-to-Speech Synthesis: Identification of Original Speaker's Voice}, year = {2014}, publisher = {Springer}, volume = {8655}, pages = {365-373}, booktitle = {Text, Speech, and Dialogue, 17th International Conference, TSD 2014, Brno, Czech Republic, September 8-12, 2014. Proceedings}, series = {Lecture Notes in Artificial Intelligence}, ISBN = {978-3-319-10815-5}, ISSN = {0302-9743}, doi = {10.1007/978-3-319-10816-2_44}, url = {http://www.kky.zcu.cz/en/publications/PribilJ_2014_GMMClassificationof}, }