Skip to content

Detail of publication

Citation

Psutka, J. and Psutka Josef V. and Radová, V. and Ircing, P. and Matoušek, J. and Müller, L. : Polish Malach Speech Corpus . Katedra kybernetiky, Fakulta aplikovaných věd, Západočeská univerzita v Plzni, Johns Hopkins Univ. v Baltimore, Shoah Visual History Foundation, 2006.

Abstract

Visual History Foundation collected recently at least 52 thousand testimonies of holocaust survivors pronounced at 32 different languages. The Polish collection is created by about 1,550 testimonies with the total length of about 3,500 hours. The corresponding Polish Malach Speech Corpus was annotated with the goal to build the large vocabulary continuous speech recognition system. For this purpose it was selected and manually transcribed 200 15-minute speech segments of individual speakers (for training purposes) and whole testimonies of 10 different survivors (about 22 hours of speech) for tests. All manual annotations were performed in the orthographic form of the words.

Detail of publication

Title: Polish Malach Speech Corpus
Author: Psutka, J. ; Psutka Josef V. ; Radová, V. ; Ircing, P. ; Matoušek, J. ; Müller, L.
Language: English
Date of publication: 1 Jan 2006
Year: 2006
Type of publication: Prototype, software
Publisher: Katedra kybernetiky, Fakulta aplikovaných věd, Západočeská univerzita v Plzni, Johns Hopkins Univ. v Baltimore, Shoah Visual History Foundation
/ 2011-06-09 12:53:22 /

Keywords

Polish spontaneous speech corpus, large vocabulary continuous speech recognition

BibTeX

@MISC{PsutkaJ_2006_PolishMalachSpeech,
 author = {Psutka, J. and Psutka Josef V. and Radov\'{a}, V. and Ircing, P. and Matou\v{s}ek, J. and M\"{u}ller, L.},
 title = {Polish Malach Speech Corpus},
 year = {2006},
 publisher = {Katedra kybernetiky, Fakulta aplikovan\'{y}ch v\v{e}d, Z\'{a}pado\v{c}esk\'{a} univerzita v Plzni, Johns Hopkins Univ.  v Baltimore, Shoah Visual History Foundation},
 url = {http://www.kky.zcu.cz/en/publications/PsutkaJ_2006_PolishMalachSpeech},
}