Publications
Detail of publication
Citation
Katedra kybernetiky, Fakulta aplikovaných věd, Západočeská univerzita v Plzni, Johns Hopkins University v Baltimore, Shoah Visual History Foundati, 2005. : Russian Spontaneaous Speech – Acoustic&Language Models (MALACH) .
Abstract
The Visual History Foundation collected recently about 52 thousand testimonies of Holocaust survivors pronounced in 32 languages. There are approx. 7,050 Russian testimonies with a total length of 16,000 hours. It is not feasible to transcribe all those testimonies maually due to the enornous time and money demands. Thus the transcription is performed using the automatic speech recognition system – data forthe system development were acquired from the Russian Malach Speech Corpus. The basic AM unit is a triphone represented by a 5-state HMM, where every state is modeled as a GMM with 16 mixtures. The total number of states was reduced to 6969 using a phonetic clustering tree. The language model is designed as a combination of 2 bigram models.
Detail of publication
Title: | Russian Spontaneaous Speech – Acoustic&Language Models (MALACH) |
---|---|
Author: | Ircing, P. ; Psutka, J. ; Psutka Josef V. |
Language: | English |
Date of publication: | 1 Jan 2005 |
Year: | 2005 |
Type of publication: | Prototype, software |
Publisher: | Katedra kybernetiky, Fakulta aplikovaných věd, Západočeská univerzita v Plzni, Johns Hopkins University v Baltimore, Shoah Visual History Foundati |
Keywords
Russian acoustic model, Russian language model, Speech recognition
BibTeX
@MISC{IrcingP_2005_RussianSpontaneaous, author = {Ircing, P. and Psutka, J. and Psutka Josef V.}, title = {Russian Spontaneaous Speech - Acoustic&Language Models (MALACH)}, year = {2005}, publisher = {Katedra kybernetiky, Fakulta aplikovan\'{y}ch v\v{e}d, Z\'{a}pado\v{c}esk\'{a} univerzita v Plzni, Johns Hopkins University v Baltimore, Shoah Visual History Foundati}, url = {http://www.kky.zcu.cz/en/publications/IrcingP_2005_RussianSpontaneaous}, }