Skip to content

Detail of publication

Citation

Císař, P. and Železný, M. and Krňoul, Z. and Kanis, J. and Zelinka, J. and Müller, L. : Design and recording of Czech speech corpus for audio-visual continuous speech recognition . Proceedings of the Auditory-Visual Speech Processing International Conference 2005, p. 1-4, AVSP2005, Vancouver Island, 2005.

Abstract

In this paper we describe the design, recording, and content of a large audio-visual speech database intended for training and testing of audio-visual continuous speech recognition systems. The UWB- 05-HSCAVC database contains high resolution video and quality audio data suitable for experiments on audio-visual speech recognition. The corpus consists of nearly 40 hours of audiovisual records of 100 speakers in laboratory conditions. The whole database was collected using static illumination. Recorded subjects were asked to remain static with almost no head movements. The whole corpus is annotated and pre-processed to be ready to use in audio-visual speech recognition experiments. The purpose of the corpus is to provide data for evaluation of visual speech parameterizations. The corpus pre-processing was designed for use with both image-based and contour-based visual speech parameterizations.

Detail of publication

Title: Design and recording of Czech speech corpus for audio-visual continuous speech recognition
Author: Císař, P. ; Železný, M. ; Krňoul, Z. ; Kanis, J. ; Zelinka, J. ; Müller, L.
Language: English
Date of publication: 24 Jul 2005
Year: 2005
Type of publication: Papers in proceedings of reviewed conferences
Title of journal or book: Proceedings of the Auditory-Visual Speech Processing International Conference 2005
Page: 1 - 4
ISBN: 1 876346 53 1
Publisher: AVSP2005
Address: Vancouver Island
Date: 24 Jul 2005 - 27 Jul 2005
/ /

Keywords

czech corpus, audio-visual, speech recognition, lipreading, speech reading

BibTeX

@INPROCEEDINGS{CisarP_2005_Designandrecording,
 author = {C\'{i}sa\v{r}, P. and \v{Z}elezn\'{y}, M. and Kr\v{n}oul, Z. and Kanis, J. and Zelinka, J. and M\"{u}ller, L.},
 title = {Design and recording of Czech speech corpus for audio-visual continuous speech recognition},
 year = {2005},
 publisher = {AVSP2005},
 journal = {Proceedings of the Auditory-Visual Speech Processing International Conference 2005},
 address = {Vancouver Island},
 pages = {1-4},
 ISBN = {1 876346 53 1},
 url = {http://www.kky.zcu.cz/en/publications/CisarP_2005_Designandrecording},
}