Publications
Detail of publication
Citation
p. 493-498, Anatolya Publishers, St.Petersburg, 2006. : Audio-visual speech recognition for Slavonic languages (Czech and Russian) . Proceedings of the 11th international conference "Speech and computer" SPECOM'2006 ,
Abstract
The paper presents the results of recent experiments with audio-visual speech recognition for two popular Slavonic languages: Russian and Czech. The description of test applied tasks, the process of multimodal databases collection and data pre-processing, methods for visual features extraction (geometric shape-based features; DCT and PCA pixel-based visual parameterization) as well as models of audio-visual recognition (concatenation of feature vectors and multi-stream models) are described. The prototypes of applied systems which will use the audio-visual speech recognition engine are mainly directed to the market of intellectual applications such as inquiry machines, video conference communications, moving objects control in noisy environments, etc.
Detail of publication
Title: | Audio-visual speech recognition for Slavonic languages (Czech and Russian) |
---|---|
Author: | Císař, P. ; Zelinka, J. ; Železný, M. ; Karpov, A. ; Ronzhin, A. |
Language: | English |
Date of publication: | 25 Jun 2006 |
Year: | 2006 |
Type of publication: | Papers in proceedings of reviewed conferences |
Title of journal or book: | Proceedings of the 11th international conference "Speech and computer" SPECOM'2006 |
Page: | 493 - 498 |
ISBN: | 5-7452-0074-X |
Publisher: | Anatolya Publishers |
Address: | St.Petersburg |
Date: | 25 Jun 2006 - 29 Jun 2006 |
Keywords
audio-visual speech recognition, slavonic languages
BibTeX
@INPROCEEDINGS{CisarP_2006_Audio-visualspeech, author = {C\'{i}sa\v{r}, P. and Zelinka, J. and \v{Z}elezn\'{y}, M. and Karpov, A. and Ronzhin, A.}, title = {Audio-visual speech recognition for Slavonic languages (Czech and Russian)}, year = {2006}, publisher = {Anatolya Publishers}, journal = {Proceedings of the 11th international conference "Speech and computer" SPECOM'2006 }, address = {St.Petersburg}, pages = {493-498}, ISBN = {5-7452-0074-X}, url = {http://www.kky.zcu.cz/en/publications/CisarP_2006_Audio-visualspeech}, }