Přejít na obsah

Detail publikace

Citace

Tihelka, D and Matoušek, J and Hanzlíček, Z. : Modelling F0 Dynamics in Unit Selection Based Speech Synthesis . Text, Speech, and Dialogue, 17th International Conference, TSD 2014, Brno, Czech Republic, September 8-12, 2014. Proceedings, Lecture Notes in Artificial Intelligence, vol. 8655, p. 457-464, 2014.

Další informace


Springerlink

Abstrakt

In the common unit selection implementations, F0 continuity is measured as one of concatenation cost features with the expectation that smooth units transition (regarding speech melody) is ensured when the difference of F0 is low enough. This measure generally uses a static F0 value computed at the units boundary. In the present paper we show, however, that the use of static F0 values is not enough for smooth speech units concatenation, and that a dynamic nature of the F0 contour must be taken into account. Two schemes of dynamic F0 handling are presented, and speech generated by both schemes is compared by means of listening tests on specially selected phrases which are known to carry unnatural artefacts. Advantages and disadvantages of the individual schemes are also discussed.

Detail publikace

Název: Modelling F0 Dynamics in Unit Selection Based Speech Synthesis
Autor: Tihelka, D ; Matoušek, J ; Hanzlíček, Z.
Název - česky: Modelování dynamiky F0 v syntéze řeči výběrem jednotek
Jazyk publikace: česky
Datum vydání: 8.9.2014
Rok vydání: 2014
Typ publikace: Stať ve sborníku
Název knihy: Text, Speech, and Dialogue, 17th International Conference, TSD 2014, Brno, Czech Republic, September 8-12, 2014. Proceedings
Svazek: Lecture Notes in Artificial Intelligence
Číslo vydání: 8655
Strana: 457 - 464
DOI: 10.1007/978-3-319-10816-2_55
ISBN: 978-3-319-10815-5
ISSN: 0302-9743
Datum: 8.9.2014 - 12.9.2014
/ 2016-01-13 17:10:56 /

Klíčová slova

text-to-speech synthesis, unit selection, concatenation cost, fundamental frequency F0

BibTeX

@INCOLLECTION{TihelkaD_2014_ModellingF0Dynamics,
 author = {Tihelka, D and Matou\v{s}ek, J and Hanzl\'{i}\v{c}ek, Z.},
 title = {Modelling F0 Dynamics in Unit Selection Based Speech Synthesis},
 year = {2014},
 volume = {8655},
 pages = {457-464},
 booktitle = {Text, Speech, and Dialogue, 17th International Conference, TSD 2014, Brno, Czech Republic, September 8-12, 2014. Proceedings},
 series = {Lecture Notes in Artificial Intelligence},
 ISBN = {978-3-319-10815-5},
 ISSN = {0302-9743},
 doi = {10.1007/978-3-319-10816-2_55},
 url = {http://www.kky.zcu.cz/en/publications/TihelkaD_2014_ModellingF0Dynamics},
}