Skip to content

Detail of publication

Citation

Tihelka, D and Matoušek, J and Hanzlíček, Z. : Modelling F0 Dynamics in Unit Selection Based Speech Synthesis . Text, Speech, and Dialogue, 17th International Conference, TSD 2014, Brno, Czech Republic, September 8-12, 2014. Proceedings, Lecture Notes in Artificial Intelligence, vol. 8655, p. 457-464, 2014.

Additional information


Springerlink

Abstract

In the common unit selection implementations, F0 continuity is measured as one of concatenation cost features with the expectation that smooth units transition (regarding speech melody) is ensured when the difference of F0 is low enough. This measure generally uses a static F0 value computed at the units boundary. In the present paper we show, however, that the use of static F0 values is not enough for smooth speech units concatenation, and that a dynamic nature of the F0 contour must be taken into account. Two schemes of dynamic F0 handling are presented, and speech generated by both schemes is compared by means of listening tests on specially selected phrases which are known to carry unnatural artefacts. Advantages and disadvantages of the individual schemes are also discussed.

Detail of publication

Title: Modelling F0 Dynamics in Unit Selection Based Speech Synthesis
Author: Tihelka, D ; Matoušek, J ; Hanzlíček, Z.
Language: Czech
Date of publication: 8 Sep 2014
Year: 2014
Type of publication: Papers in proceedings of reviewed conferences
Book title: Text, Speech, and Dialogue, 17th International Conference, TSD 2014, Brno, Czech Republic, September 8-12, 2014. Proceedings
Series: Lecture Notes in Artificial Intelligence
Číslo vydání: 8655
Page: 457 - 464
DOI: 10.1007/978-3-319-10816-2_55
ISBN: 978-3-319-10815-5
ISSN: 0302-9743
Date: 8 Sep 2014 - 12 Sep 2014
/ 2016-01-13 17:10:56 /

Keywords

text-to-speech synthesis, unit selection, concatenation cost, fundamental frequency F0

BibTeX

@INCOLLECTION{TihelkaD_2014_ModellingF0Dynamics,
 author = {Tihelka, D and Matou\v{s}ek, J and Hanzl\'{i}\v{c}ek, Z.},
 title = {Modelling F0 Dynamics in Unit Selection Based Speech Synthesis},
 year = {2014},
 volume = {8655},
 pages = {457-464},
 booktitle = {Text, Speech, and Dialogue, 17th International Conference, TSD 2014, Brno, Czech Republic, September 8-12, 2014. Proceedings},
 series = {Lecture Notes in Artificial Intelligence},
 ISBN = {978-3-319-10815-5},
 ISSN = {0302-9743},
 doi = {10.1007/978-3-319-10816-2_55},
 url = {http://www.kky.zcu.cz/en/publications/TihelkaD_2014_ModellingF0Dynamics},
}