Přejít na obsah

Detail publikace

Citace

Železný, M. and Císař, P. : Czech audio-visual speech corpus of a car driver for in-vehicle audio-visual speech recognition . Proceedings of AVSP 2003, , p. 169-173, Université Stendhal, Grenoble, 2003.

Abstrakt

This paper presents the design of an audio-visual speech corpus for in-vehicle audio-visual speech recognition. Throughout the world, there exist several audio-visual speech corpora. There are also several (audio-only) speech corpora for in-vehicle recognition. So far, we have not found an audio-visual speech corpus for in-vehicle speech recognition. And, we have not found any audio-visual speech corpora for the Czech language either. Since our aim is to design an audio-visual speech recognizer for in-vehicle recognition, the first thing we had to do was to design, collect, and process the Czech in-vehicle audio-visual speech corpora. The purpose of in-vehicle speech recognition is usually its utilization for command control of car features, which does not involve driver's hands. Thus, in real deployment, it will be the driver, whose speech will be recognized. We decided to collect the driver's speech for training purposes.

Detail publikace

Název: Czech audio-visual speech corpus of a car driver for in-vehicle audio-visual speech recognition
Autor: Železný, M. ; Císař, P.
Jazyk publikace: anglicky
Datum vydání: 4.9.2003
Rok vydání: 2003
Typ publikace: Stať ve sborníku
Název časopisu / knihy: Proceedings of AVSP 2003
Edice:
Strana: 169 - 173
Nakladatel: Université Stendhal
Místo vydání: Grenoble
Datum: 4.9.2003 - 7.9.2003
/ /

Klíčová slova

audio-visual speech recognition, speech corpora, audio-visual speech processing, speech recognition

Klíčová slova v češtině

audiovizuální rozpoznávání řeči, řečové korpusy, audiovizuální zpracování řeči, rozpoznávání řeči

BibTeX

@INPROCEEDINGS{ZeleznyM_2003_Czechaudio-visual_1,
 author = {\v{Z}elezn\'{y}, M. and C\'{i}sa\v{r}, P.},
 title = {Czech audio-visual speech corpus of a car driver for in-vehicle audio-visual speech recognition},
 year = {2003},
 publisher = {Universit\'{e} Stendhal},
 journal = {Proceedings of AVSP 2003},
 address = {Grenoble},
 pages = {169-173},
 url = {http://www.kky.zcu.cz/en/publications/ZeleznyM_2003_Czechaudio-visual_1},
}