Publications
Detail of publication
Citation
p. 1-4, AVSP2005, Vancouver Island, 2005. : Design and recording of Czech speech corpus for audio-visual continuous speech recognition . Proceedings of the Auditory-Visual Speech Processing International Conference 2005,
Abstract
In this paper we describe the design, recording, and content of a large audio-visual speech database intended for training and testing of audio-visual continuous speech recognition systems. The UWB- 05-HSCAVC database contains high resolution video and quality audio data suitable for experiments on audio-visual speech recognition. The corpus consists of nearly 40 hours of audiovisual records of 100 speakers in laboratory conditions. The whole database was collected using static illumination. Recorded subjects were asked to remain static with almost no head movements. The whole corpus is annotated and pre-processed to be ready to use in audio-visual speech recognition experiments. The purpose of the corpus is to provide data for evaluation of visual speech parameterizations. The corpus pre-processing was designed for use with both image-based and contour-based visual speech parameterizations.
Detail of publication
Title: | Design and recording of Czech speech corpus for audio-visual continuous speech recognition |
---|---|
Author: | Císař, P. ; Železný, M. ; Krňoul, Z. ; Kanis, J. ; Zelinka, J. ; Müller, L. |
Language: | English |
Date of publication: | 24 Jul 2005 |
Year: | 2005 |
Type of publication: | Papers in proceedings of reviewed conferences |
Title of journal or book: | Proceedings of the Auditory-Visual Speech Processing International Conference 2005 |
Page: | 1 - 4 |
ISBN: | 1 876346 53 1 |
Publisher: | AVSP2005 |
Address: | Vancouver Island |
Date: | 24 Jul 2005 - 27 Jul 2005 |
Keywords
czech corpus, audio-visual, speech recognition, lipreading, speech reading
BibTeX
@INPROCEEDINGS{CisarP_2005_Designandrecording, author = {C\'{i}sa\v{r}, P. and \v{Z}elezn\'{y}, M. and Kr\v{n}oul, Z. and Kanis, J. and Zelinka, J. and M\"{u}ller, L.}, title = {Design and recording of Czech speech corpus for audio-visual continuous speech recognition}, year = {2005}, publisher = {AVSP2005}, journal = {Proceedings of the Auditory-Visual Speech Processing International Conference 2005}, address = {Vancouver Island}, pages = {1-4}, ISBN = {1 876346 53 1}, url = {http://www.kky.zcu.cz/en/publications/CisarP_2005_Designandrecording}, }