Publications
Detail of publication
Citation
p. 457-464, 2014. : Modelling F0 Dynamics in Unit Selection Based Speech Synthesis . Text, Speech, and Dialogue, 17th International Conference, TSD 2014, Brno, Czech Republic, September 8-12, 2014. Proceedings, Lecture Notes in Artificial Intelligence, vol. 8655,
Additional information
Abstract
In the common unit selection implementations, F0 continuity is measured as one of concatenation cost features with the expectation that smooth units transition (regarding speech melody) is ensured when the difference of F0 is low enough. This measure generally uses a static F0 value computed at the units boundary. In the present paper we show, however, that the use of static F0 values is not enough for smooth speech units concatenation, and that a dynamic nature of the F0 contour must be taken into account. Two schemes of dynamic F0 handling are presented, and speech generated by both schemes is compared by means of listening tests on specially selected phrases which are known to carry unnatural artefacts. Advantages and disadvantages of the individual schemes are also discussed.
Detail of publication
Title: | Modelling F0 Dynamics in Unit Selection Based Speech Synthesis |
---|---|
Author: | Tihelka, D ; Matoušek, J ; Hanzlíček, Z. |
Language: | Czech |
Date of publication: | 8 Sep 2014 |
Year: | 2014 |
Type of publication: | Papers in proceedings of reviewed conferences |
Book title: | Text, Speech, and Dialogue, 17th International Conference, TSD 2014, Brno, Czech Republic, September 8-12, 2014. Proceedings |
Series: | Lecture Notes in Artificial Intelligence |
Číslo vydání: | 8655 |
Page: | 457 - 464 |
DOI: | 10.1007/978-3-319-10816-2_55 |
ISBN: | 978-3-319-10815-5 |
ISSN: | 0302-9743 |
Date: | 8 Sep 2014 - 12 Sep 2014 |
Keywords
text-to-speech synthesis, unit selection, concatenation cost, fundamental frequency F0
BibTeX
@INCOLLECTION{TihelkaD_2014_ModellingF0Dynamics, author = {Tihelka, D and Matou\v{s}ek, J and Hanzl\'{i}\v{c}ek, Z.}, title = {Modelling F0 Dynamics in Unit Selection Based Speech Synthesis}, year = {2014}, volume = {8655}, pages = {457-464}, booktitle = {Text, Speech, and Dialogue, 17th International Conference, TSD 2014, Brno, Czech Republic, September 8-12, 2014. Proceedings}, series = {Lecture Notes in Artificial Intelligence}, ISBN = {978-3-319-10815-5}, ISSN = {0302-9743}, doi = {10.1007/978-3-319-10816-2_55}, url = {http://www.kky.zcu.cz/en/publications/TihelkaD_2014_ModellingF0Dynamics}, }