Skip to content

Detail of publication

Citation

Železný, M. and Krňoul, Z. : Czech audio-visual speech synthesis with an HMM-trained speech database and enhanced coarticulation . Digest of the Proceedings of the WSEAS Conferences, p. 4631741-4631746, WSEAS, Rethymno, 2003.

Abstract

The task of visual speech synthesis is usually solved by concatenation of basic speech units selected from a visual speech database. There are two main problems in this process. The first problem is a design of a database, that means estimation of the database parameters for all basic speech units. Second problem is a way how to concatenate selected basic phonetic units so as to eliminate the coarticulation effect. Both problems are aimed in our work, resulting in the Czech audio-visual speech synthesizer. We use HMM training process instead of some form of averaging for obtaining statistically best parameters for all basic phonetic units. For solution of a coarticulation effect we use the method of dominance functions.

Detail of publication

Title: Czech audio-visual speech synthesis with an HMM-trained speech database and enhanced coarticulation
Author: Železný, M. ; Krňoul, Z.
Language: English
Date of publication: 13 Oct 2003
Year: 2003
Type of publication: Papers in proceedings of reviewed conferences
Title of journal or book: Digest of the Proceedings of the WSEAS Conferences
Page: 4631741 - 4631746
ISBN: 960-8052-90-4
Publisher: WSEAS
Address: Rethymno
Date: 13 Oct 2003 - 15 Oct 2003
/ /

Keywords

audio-visual speech database, audio-visual speech synthesis, talking head, coarticulation, hidden Markov models

BibTeX

@INPROCEEDINGS{ZeleznyM_2003_Czechaudio-visual_2,
 author = {\v{Z}elezn\'{y}, M. and Kr\v{n}oul, Z.},
 title = {Czech audio-visual speech synthesis with an HMM-trained speech database and enhanced coarticulation},
 year = {2003},
 publisher = {WSEAS},
 journal = {Digest of the Proceedings of the WSEAS Conferences},
 address = {Rethymno},
 pages = {4631741-4631746},
 ISBN = {960-8052-90-4},
 url = {http://www.kky.zcu.cz/en/publications/ZeleznyM_2003_Czechaudio-visual_2},
}