Skip to content

Detail of publication

Citation

Přibil, J and Přibilová, A and Matoušek, J. : GMM Classification of Text-to-Speech Synthesis: Identification of Original Speaker’s Voice . Text, Speech, and Dialogue, 17th International Conference, TSD 2014, Brno, Czech Republic, September 8-12, 2014. Proceedings, Lecture Notes in Artificial Intelligence, vol. 8655, p. 365-373, Springer, 2014.

Additional information


Springerlink

Abstract

This paper describes two experiments. The first one deals with evaluation of synthetic speech quality by reverse identification of original speakers whose voices had been used for several Czech text-to-speech (TTS) systems. The second experiment was aimed at evaluation of the influence of voice transformation on the original speaker recognition. The paper further describes an analysis of the influence of initial settings for creation and training of the Gaussian mixture models (GMM), and the influence of different types of used speech features (spectral and/or supra-segmental) on correctness of GMM identification. The stability of the identification process with respect to the duration of the tested sentence (number of the processed frames) was analysed, too.

Detail of publication

Title: GMM Classification of Text-to-Speech Synthesis: Identification of Original Speaker’s Voice
Author: Přibil, J ; Přibilová, A ; Matoušek, J.
Language: Czech
Date of publication: 8 Sep 2014
Year: 2014
Type of publication: Papers in proceedings of reviewed conferences
Book title: Text, Speech, and Dialogue, 17th International Conference, TSD 2014, Brno, Czech Republic, September 8-12, 2014. Proceedings
Series: Lecture Notes in Artificial Intelligence
Číslo vydání: 8655
Page: 365 - 373
DOI: 10.1007/978-3-319-10816-2_44
ISBN: 978-3-319-10815-5
ISSN: 0302-9743
Publisher: Springer
Date: 8 Sep 2014 - 12 Sep 2014
/ 2016-01-13 17:10:33 /

Keywords

quality of synthetic speech, text-to-speech system, GMM classification, statistical analysis

BibTeX

@INCOLLECTION{PribilJ_2014_GMMClassificationof,
 author = {P\v{r}ibil, J and P\v{r}ibilov\'{a}, A and Matou\v{s}ek, J.},
 title = {GMM Classification of Text-to-Speech Synthesis: Identification of Original Speaker's Voice},
 year = {2014},
 publisher = {Springer},
 volume = {8655},
 pages = {365-373},
 booktitle = {Text, Speech, and Dialogue, 17th International Conference, TSD 2014, Brno, Czech Republic, September 8-12, 2014. Proceedings},
 series = {Lecture Notes in Artificial Intelligence},
 ISBN = {978-3-319-10815-5},
 ISSN = {0302-9743},
 doi = {10.1007/978-3-319-10816-2_44},
 url = {http://www.kky.zcu.cz/en/publications/PribilJ_2014_GMMClassificationof},
}