Publications
Detail of publication
Citation
Katedra kybernetiky, Fakulta aplikovaných věd, Západočeská univerzita v Plzni, Johns Hopkins Univ. v Baltimore, Shoah Visual History Foundation, 2005. : Russian Malach Speech Corpus .
Abstract
Visual History Foundation collected recently at least 52 thousand testimonies of holocaust survivors pronounced at 32 different languages. The Russian collection is created by about 7050 testimonies with the total length of about 16,000 hours. The corresponding Russian Malach Speech Corpus was annotated with the goal to build the large vocabulary continuous speech recognition system. For this purpose it was selected and manually transcribed 400 15-minute speech segments of individual speakers (for training purposes) and whole testimonies of 10 different survivors (about 25 hours of speech) for tests. All manual annotations were performed in the orthographic form of the words.
Detail of publication
Title: | Russian Malach Speech Corpus |
---|---|
Author: | Psutka, J. ; Psutka Josef V. ; Müller, L. ; Matoušek, J. ; Radová, V. ; Ircing, P. |
Language: | English |
Date of publication: | 1 Jan 2005 |
Year: | 2005 |
Type of publication: | Prototype, software |
Publisher: | Katedra kybernetiky, Fakulta aplikovaných věd, Západočeská univerzita v Plzni, Johns Hopkins Univ. v Baltimore, Shoah Visual History Foundation |
Keywords
Russian spontaneous speech corpus, large vocabulary continuous speech recognition
BibTeX
@MISC{PsutkaJ_2005_RussianMalachSpeech, author = {Psutka, J. and Psutka Josef V. and M\"{u}ller, L. and Matou\v{s}ek, J. and Radov\'{a}, V. and Ircing, P.}, title = {Russian Malach Speech Corpus}, year = {2005}, publisher = {Katedra kybernetiky, Fakulta aplikovan\'{y}ch v\v{e}d, Z\'{a}pado\v{c}esk\'{a} univerzita v Plzni, Johns Hopkins Univ. v Baltimore, Shoah Visual History Foundation}, url = {http://www.kky.zcu.cz/en/publications/PsutkaJ_2005_RussianMalachSpeech}, }