Publications
Detail of publication
Citation
: Audio-visual speech recognition for Slavonic languages (Czech and Russian) . Proceedings of the 11th international conference "Speech and computer" SPECOM'2006 , p. 493-498, Anatolya Publishers, St.Petersburg, 2006.
Abstract
The paper presents the results of recent experiments with audio-visual speech recognition for two popular Slavonic languages: Russian and Czech. The description of test applied tasks, the process of multimodal databases collection and data pre-processing, methods for visual features extraction (geometric shape-based features; DCT and PCA pixel-based visual parameterization) as well as models of audio-visual recognition (concatenation of feature vectors and multi-stream models) are described. The prototypes of applied systems which will use the audio-visual speech recognition engine are mainly directed to the market of intellectual applications such as inquiry machines, video conference communications, moving objects control in noisy environments, etc.
Detail of publication
| Title: | Audio-visual speech recognition for Slavonic languages (Czech and Russian) |
|---|---|
| Author: | Císař, P. ; Zelinka, J. ; Železný, M. ; Karpov, A. ; Ronzhin, A. |
| Language: | English |
| Date of publication: | 25 Jun 2006 |
| Year: | 2006 |
| Type of publication: | Papers in proceedings of reviewed conferences |
| Title of journal or book: | Proceedings of the 11th international conference "Speech and computer" SPECOM'2006 |
| Page: | 493 - 498 |
| ISBN: | 5-7452-0074-X |
| Publisher: | Anatolya Publishers |
| Address: | St.Petersburg |
| Date: | 25 Jun 2006 - 29 Jun 2006 |
Keywords
audio-visual speech recognition, slavonic languages
BibTeX
@INPROCEEDINGS{CisarP_2006_Audio-visualspeech,
author = {C\'{i}sa\v{r}, P. and Zelinka, J. and \v{Z}elezn\'{y}, M. and Karpov, A. and Ronzhin, A.},
title = {Audio-visual speech recognition for Slavonic languages (Czech and Russian)},
year = {2006},
publisher = {Anatolya Publishers},
journal = {Proceedings of the 11th international conference "Speech and computer" SPECOM'2006 },
address = {St.Petersburg},
pages = {493-498},
ISBN = {5-7452-0074-X},
url = {http://www.kky.zcu.cz/en/publications/CisarP_2006_Audio-visualspeech},
}


ZČU
