Publications
Detail of publication
Citation
p. 420-435, 2004. : Automatic recognition of spontaneous speech for access to multilingual oral history archives . IEEE transactions on speech and audio processing, vol. 4,
Abstract
The paper presents initial results from experiments with speech recognition, topic segmentation, topic categorization, and named entity detection using a large collection of recorded oral histories. The work leverages a massive manual annotation effort on 10 000 h of spontaneous speech to evaluate the degree to which automatic speech recognition (ASR)-based segmentation and categorization techniques can be adapted to approximate decisions made by human annotators. ASR word error rates near 40% were achieved for both English and Czech for heavily accented, emotional and elderly spontaneous speech based on 65-84 h of transcribed speech. Topical segmentation based on shifts in the recognized English vocabulary resulted in 80% agreement with manually annotated boundary positions at a 0.35 false alarm rate. Categorization was considerably more challenging, with a nearestneighbor technique yielding F = 03.
Detail of publication
Title: | Automatic recognition of spontaneous speech for access to multilingual oral history archives |
---|---|
Author: | Byrne, W. ; Doerman, D. ; Franz, M. ; Gustman, S. ; Hajič, J. ; Oard, D. ; Picheny, M. ; Psutka, J. ; Ramabhadran, B. ; Soergel, D. ; Ward, T. ; Zhu, W. |
Language: | English |
Date of publication: | 1 Jan 2004 |
Year: | 2004 |
Type of publication: | Papers in journals |
Title of journal or book: | IEEE transactions on speech and audio processing |
Číslo vydání: | 4 |
Page: | 420 - 435 |
ISBN: | 1063-6676 |
Keywords
Automatic speech recognition (ASR), information retrieval, multilingual ASR, oral history, spoken document retrieval, spontaneous speech.
BibTeX
@ARTICLE{ByrneW_2004_Automaticrecognition, author = {Byrne, W. and Doerman, D. and Franz, M. and Gustman, S. and Haji\v{c}, J. and Oard, D. and Picheny, M. and Psutka, J. and Ramabhadran, B. and Soergel, D. and Ward, T. and Zhu, W.}, title = {Automatic recognition of spontaneous speech for access to multilingual oral history archives}, year = {2004}, journal = {IEEE transactions on speech and audio processing}, volume = {4}, pages = {420-435}, ISBN = {1063-6676}, url = {http://www.kky.zcu.cz/en/publications/ByrneW_2004_Automaticrecognition}, }