Publications
Detail of publication
Citation
: Audio-Visual Speech Asynchrony Modeling in a Talking Head . Proceedings of Interspeech 2009, 10, vol. 1, p. 2911-2914, Causal Productions, 2009.
Download PDF
Abstract
An audio-visual speech synthesis system with modeling of asynchrony between auditory and visual speech modalities is proposed in the paper. Corpus-based study of real recordings gave us the required data for understanding the problem of modalities asynchrony that is partially caused by the coarticulationphenomena. A set of context-dependent timing rules and recommendations was elaborated in order to make a synchronization of auditory and visual speech cues of the animated talking head similar to a natural humanlike way. The cognitive evaluation of the model-based talking head for Russian with implementation of the original asynchrony model has shown high intelligibility and naturalness of audio-visual synthesized speech.
Detail of publication
| Title: | Audio-Visual Speech Asynchrony Modeling in a Talking Head |
|---|---|
| Author: | Alexey Karpov ; Liliya Tsirulnik ; Zdeněk Krňoul ; Andrey Ronzhin ; Boris Lobanov ; Miloš Železný |
| Language: | English |
| Date of publication: | 10 Sep 2009 |
| Year: | 2009 |
| Type of publication: | Papers in journals |
| Book title: | Proceedings of Interspeech 2009 |
| Series: | 10 |
| Číslo vydání: | 1 |
| Page: | 2911 - 2914 |
| ISSN: | 1990-9772 |
| Publisher: | Causal Productions |
Keywords
audio-visual speech processing, text-to-speech synthesis, multimodal speech perception, cognitive study
BibTeX
@ARTICLE{AlexeyKarpov_2009_Audio-VisualSpeech,
author = {Alexey Karpov and Liliya Tsirulnik and Zden\v{e}k Kr\v{n}oul and Andrey Ronzhin and Boris Lobanov and Milo\v{s} \v{Z}elezn\'{y}},
title = {Audio-Visual Speech Asynchrony Modeling in a Talking Head},
year = {2009},
publisher = {Causal Productions},
volume = {1},
pages = {2911-2914},
booktitle = {Proceedings of Interspeech 2009},
series = {10},
ISSN = {1990-9772},
url = {http://www.kky.zcu.cz/en/publications/AlexeyKarpov_2009_Audio-VisualSpeech},
}


ZČU
