Publikace
Detail publikace
Citace
p. 169-173, Université Stendhal, Grenoble, 2003. : Czech audio-visual speech corpus of a car driver for in-vehicle audio-visual speech recognition . Proceedings of AVSP 2003, ,
Abstrakt
This paper presents the design of an audio-visual speech corpus for in-vehicle audio-visual speech recognition. Throughout the world, there exist several audio-visual speech corpora. There are also several (audio-only) speech corpora for in-vehicle recognition. So far, we have not found an audio-visual speech corpus for in-vehicle speech recognition. And, we have not found any audio-visual speech corpora for the Czech language either. Since our aim is to design an audio-visual speech recognizer for in-vehicle recognition, the first thing we had to do was to design, collect, and process the Czech in-vehicle audio-visual speech corpora. The purpose of in-vehicle speech recognition is usually its utilization for command control of car features, which does not involve driver's hands. Thus, in real deployment, it will be the driver, whose speech will be recognized. We decided to collect the driver's speech for training purposes.
Detail publikace
Název: | Czech audio-visual speech corpus of a car driver for in-vehicle audio-visual speech recognition |
---|---|
Autor: | Železný, M. ; Císař, P. |
Jazyk publikace: | anglicky |
Datum vydání: | 4.9.2003 |
Rok vydání: | 2003 |
Typ publikace: | Stať ve sborníku |
Název časopisu / knihy: | Proceedings of AVSP 2003 |
Edice: | |
Strana: | 169 - 173 |
Nakladatel: | Université Stendhal |
Místo vydání: | Grenoble |
Datum: | 4.9.2003 - 7.9.2003 |
Klíčová slova
audio-visual speech recognition, speech corpora, audio-visual speech processing, speech recognition
Klíčová slova v češtině
audiovizuální rozpoznávání řeči, řečové korpusy, audiovizuální zpracování řeči, rozpoznávání řeči
BibTeX
@INPROCEEDINGS{ZeleznyM_2003_Czechaudio-visual_1, author = {\v{Z}elezn\'{y}, M. and C\'{i}sa\v{r}, P.}, title = {Czech audio-visual speech corpus of a car driver for in-vehicle audio-visual speech recognition}, year = {2003}, publisher = {Universit\'{e} Stendhal}, journal = {Proceedings of AVSP 2003}, address = {Grenoble}, pages = {169-173}, url = {http://www.kky.zcu.cz/en/publications/ZeleznyM_2003_Czechaudio-visual_1}, }