Skip to content

Detail of publication

Citation

Byrne, W. and Doerman, D. and Franz, M. and Gustman, S. and Hajič, J. and Oard, D. and Picheny, M. and Psutka, J. and Ramabhadran, B. and Soergel, D. and Ward, T. and Zhu, W. : Automatic recognition of spontaneous speech for access to multilingual oral history archives . IEEE transactions on speech and audio processing, vol. 4, p. 420-435, 2004.

Abstract

The paper presents initial results from experiments with speech recognition, topic segmentation, topic categorization, and named entity detection using a large collection of recorded oral histories. The work leverages a massive manual annotation effort on 10 000 h of spontaneous speech to evaluate the degree to which automatic speech recognition (ASR)-based segmentation and categorization techniques can be adapted to approximate decisions made by human annotators. ASR word error rates near 40% were achieved for both English and Czech for heavily accented, emotional and elderly spontaneous speech based on 65-84 h of transcribed speech. Topical segmentation based on shifts in the recognized English vocabulary resulted in 80% agreement with manually annotated boundary positions at a 0.35 false alarm rate. Categorization was considerably more challenging, with a nearestneighbor technique yielding F = 03.

Detail of publication

Title: Automatic recognition of spontaneous speech for access to multilingual oral history archives
Author: Byrne, W. ; Doerman, D. ; Franz, M. ; Gustman, S. ; Hajič, J. ; Oard, D. ; Picheny, M. ; Psutka, J. ; Ramabhadran, B. ; Soergel, D. ; Ward, T. ; Zhu, W.
Language: English
Date of publication: 1 Jan 2004
Year: 2004
Type of publication: Papers in journals
Title of journal or book: IEEE transactions on speech and audio processing
Číslo vydání: 4
Page: 420 - 435
ISBN: 1063-6676
/ /

Keywords

Automatic speech recognition (ASR), information retrieval, multilingual ASR, oral history, spoken document retrieval, spontaneous speech.

BibTeX

@ARTICLE{ByrneW_2004_Automaticrecognition,
 author = {Byrne, W. and Doerman, D. and Franz, M. and Gustman, S. and Haji\v{c}, J. and Oard, D. and Picheny, M. and Psutka, J. and Ramabhadran, B. and Soergel, D. and Ward, T. and Zhu, W.},
 title = {Automatic recognition of spontaneous speech for access to multilingual oral history archives},
 year = {2004},
 journal = {IEEE transactions on speech and audio processing},
 volume = {4},
 pages = {420-435},
 ISBN = {1063-6676},
 url = {http://www.kky.zcu.cz/en/publications/ByrneW_2004_Automaticrecognition},
}