Skip to content

Detail of publication

Citation

Císař, P. and Zelinka, J. and Železný, M. and Karpov, A. and Ronzhin, A. : Audio-visual speech recognition for Slavonic languages (Czech and Russian) . Proceedings of the 11th international conference "Speech and computer" SPECOM'2006 , p. 493-498, Anatolya Publishers, St.Petersburg, 2006.

Abstract

The paper presents the results of recent experiments with audio-visual speech recognition for two popular Slavonic languages: Russian and Czech. The description of test applied tasks, the process of multimodal databases collection and data pre-processing, methods for visual features extraction (geometric shape-based features; DCT and PCA pixel-based visual parameterization) as well as models of audio-visual recognition (concatenation of feature vectors and multi-stream models) are described. The prototypes of applied systems which will use the audio-visual speech recognition engine are mainly directed to the market of intellectual applications such as inquiry machines, video conference communications, moving objects control in noisy environments, etc.

Detail of publication

Title: Audio-visual speech recognition for Slavonic languages (Czech and Russian)
Author: Císař, P. ; Zelinka, J. ; Železný, M. ; Karpov, A. ; Ronzhin, A.
Language: English
Date of publication: 25 Jun 2006
Year: 2006
Type of publication: Papers in proceedings of reviewed conferences
Title of journal or book: Proceedings of the 11th international conference "Speech and computer" SPECOM'2006
Page: 493 - 498
ISBN: 5-7452-0074-X
Publisher: Anatolya Publishers
Address: St.Petersburg
Date: 25 Jun 2006 - 29 Jun 2006
/ /

Keywords

audio-visual speech recognition, slavonic languages

BibTeX

@INPROCEEDINGS{CisarP_2006_Audio-visualspeech,
 author = {C\'{i}sa\v{r}, P. and Zelinka, J. and \v{Z}elezn\'{y}, M. and Karpov, A. and Ronzhin, A.},
 title = {Audio-visual speech recognition for Slavonic languages (Czech and Russian)},
 year = {2006},
 publisher = {Anatolya Publishers},
 journal = {Proceedings of the 11th international conference "Speech and computer" SPECOM'2006 },
 address = {St.Petersburg},
 pages = {493-498},
 ISBN = {5-7452-0074-X},
 url = {http://www.kky.zcu.cz/en/publications/CisarP_2006_Audio-visualspeech},
}