Publications
Detail of publication
Citation
p. 139-142, INSTICC PRESS, Lisabon, 2007. : LIVE TV SUBTITLING - Fast 2-pass LVCSR System for Online Subtitling . SIGMAP 2007, ,
Abstract
The paper describes a fast 2-pass large vocabulary continuous speech recognition (LVCSR) system for automatic online subtitling of live TV programs. The proposed system implementation can be used for direct recognition of TV program audio channel or recognition of a shadow speaker who re-speaks the original audio channel. The first part of this paper focuses on preparation of an adaptive language model for TV programs, where person names are specific for each subtitling session and have to be added to the recognition vocabulary. The second part outlines the recognition system conception for automatic online subtitling with vocabulary up to 150 000 words in real-time. The recognition system is based on Hidden Markov Models, lexical trees and bigram and quadgram language models in the first and second pass, respectively. Finally, experimental results from our project with the Czech Television are reported and discussed.
Detail of publication
Title: | LIVE TV SUBTITLING - Fast 2-pass LVCSR System for Online Subtitling |
---|---|
Author: | Pražák, A. ; Müller, L. ; Psutka Josef V.. ; Psutka, J. |
Language: | English |
Date of publication: | 28 Jul 2007 |
Year: | 2007 |
Type of publication: | Papers in proceedings of reviewed conferences |
Title of journal or book: | SIGMAP 2007 |
Edition: | |
Page: | 139 - 142 |
ISBN: | 978-989-8111-13-5 |
Publisher: | INSTICC PRESS |
Address: | Lisabon |
Date: | 28 Jul 2007 - 31 Jul 2007 |
Keywords
ASR, LVCSR, HMM, real-time, class-based language model, live TV, online subtitling
BibTeX
@INPROCEEDINGS{PrazakA_2007_LIVETVSUBTITLING, author = {Pra\v{z}\'{a}k, A. and M\"{u}ller, L. and Psutka Josef V.. and Psutka, J.}, title = {LIVE TV SUBTITLING - Fast 2-pass LVCSR System for Online Subtitling}, year = {2007}, publisher = {INSTICC PRESS}, journal = {SIGMAP 2007}, address = {Lisabon}, pages = {139-142}, ISBN = {978-989-8111-13-5}, url = {http://www.kky.zcu.cz/en/publications/PrazakA_2007_LIVETVSUBTITLING}, }