Skip to content

Detail of publication

Citation

Psutka, J. and Psutka, J. and Radová, V. and Ircing, P. and Matoušek, J. and Müller, L. : Czech Malach Speech Corpus . Katedra kybernetiky, Fakulta aplikovaných věd, Západočeská univerzita v Plzni, Johns Hopkins University v Baltimore, Shoah Visual History Foundati, 2003.

Abstract

Visual History Foundation collected recently at least 52 thousand testimonies of holocaust survivors pronounced at 32 different languages. The Czech collection is created by about 570 testimonies with the total length of about 1,200 hours. The corresponding Czech Malach Speech Corpus was annotated with the goal to build the large vocabulary continuous speech recognition system. For this purpose it was selected and manually transcribed 336 15-minute speech segments of individual speakers (for training purposes) and whole testimonies of 10 different survivors (about 20 hours of speech) for tests. All manual annotations were performed in the orthographic form of the words. This means that the eventual colloquial words were neither transformed to standard (formal, non-colloquial) forms nor written phonetically. Czech colloquial words are usually not considered to be phonetic variants of standard Czech words therefore they are written in their colloquial orthographic form.

Detail of publication

Title: Czech Malach Speech Corpus
Author: Psutka, J. ; Psutka, J. ; Radová, V. ; Ircing, P. ; Matoušek, J. ; Müller, L.
Language: English
Date of publication: 1 Jan 2003
Year: 2003
Type of publication: Prototype, software
Publisher: Katedra kybernetiky, Fakulta aplikovaných věd, Západočeská univerzita v Plzni, Johns Hopkins University v Baltimore, Shoah Visual History Foundati
/ /

Keywords

spontaneous speech corpus, large vocabulary continuous speech recognition

BibTeX

@MISC{PsutkaJ_2003_CzechMalachSpeech,
 author = {Psutka, J. and Psutka, J. and Radov\'{a}, V. and Ircing, P. and Matou\v{s}ek, J. and M\"{u}ller, L.},
 title = {Czech Malach Speech Corpus},
 year = {2003},
 publisher = {Katedra kybernetiky, Fakulta aplikovan\'{y}ch v\v{e}d, Z\'{a}pado\v{c}esk\'{a} univerzita v Plzni, Johns Hopkins University v Baltimore, Shoah Visual History Foundati},
 url = {http://www.kky.zcu.cz/en/publications/PsutkaJ_2003_CzechMalachSpeech},
}