Publications
Detail of publication
Citation
p. 2911-2914, Causal Productions, 2009. : Audio-Visual Speech Asynchrony Modeling in a Talking Head . Proceedings of Interspeech 2009, 10, vol. 1,
Download PDF
Abstract
An audio-visual speech synthesis system with modeling of asynchrony between auditory and visual speech modalities is proposed in the paper. Corpus-based study of real recordings gave us the required data for understanding the problem of modalities asynchrony that is partially caused by the coarticulationphenomena. A set of context-dependent timing rules and recommendations was elaborated in order to make a synchronization of auditory and visual speech cues of the animated talking head similar to a natural humanlike way. The cognitive evaluation of the model-based talking head for Russian with implementation of the original asynchrony model has shown high intelligibility and naturalness of audio-visual synthesized speech.
Detail of publication
Title: | Audio-Visual Speech Asynchrony Modeling in a Talking Head |
---|---|
Author: | Alexey Karpov ; Liliya Tsirulnik ; Zdeněk Krňoul ; Andrey Ronzhin ; Boris Lobanov ; Miloš Železný |
Language: | English |
Date of publication: | 10 Sep 2009 |
Year: | 2009 |
Type of publication: | Papers in journals |
Book title: | Proceedings of Interspeech 2009 |
Series: | 10 |
Číslo vydání: | 1 |
Page: | 2911 - 2914 |
ISSN: | 1990-9772 |
Publisher: | Causal Productions |
Keywords
audio-visual speech processing, text-to-speech synthesis, multimodal speech perception, cognitive study
BibTeX
@ARTICLE{AlexeyKarpov_2009_Audio-VisualSpeech, author = {Alexey Karpov and Liliya Tsirulnik and Zden\v{e}k Kr\v{n}oul and Andrey Ronzhin and Boris Lobanov and Milo\v{s} \v{Z}elezn\'{y}}, title = {Audio-Visual Speech Asynchrony Modeling in a Talking Head}, year = {2009}, publisher = {Causal Productions}, volume = {1}, pages = {2911-2914}, booktitle = {Proceedings of Interspeech 2009}, series = {10}, ISSN = {1990-9772}, url = {http://www.kky.zcu.cz/en/publications/AlexeyKarpov_2009_Audio-VisualSpeech}, }