Publications
Detail of publication
Citation
p. 1577-1580, ISCA, Geneva, 2003. : The Czech speech and prosody database both for ASR and TTS purposes . EUROSPEECH 2003 PROCEEDINGS,
Download PDF
Abstract
This paper describes a preparation of the first large Czech prosodic database which should be useful both in automatic speech recognition (ASR) and text-to-speech (TTS) synthesis. In the area of ASR we intend to use it for an automatic punctuation annotation, in the area of TTS for building a prosodic module for the Czech high-quality synthesis. The database is based on the Czech Radio&TV Broadcast News Corpus (UWB_B02) recorded at the University of West Bohemia. The configuration of the database includes recorded speech, raw and stylized F0 values, frame level energy values, a word- and phoneme-level time alignment, and a linguistically motivated description of the prosodic data. A technique of prosodic data acquisition and stylization is described. A new tagset for a linguistical annotation of the Czech prosody is proposed and used.
Detail of publication
Title: | The Czech speech and prosody database both for ASR and TTS purposes |
---|---|
Author: | Kolář, J. ; Romportl, J. ; Psutka, J. |
Language: | English |
Date of publication: | 1 Sep 2003 |
Year: | 2003 |
Type of publication: | Papers in proceedings of reviewed conferences |
Title of journal or book: | EUROSPEECH 2003 PROCEEDINGS |
Page: | 1577 - 1580 |
Publisher: | ISCA |
Address: | Geneva |
Date: | 1 Sep 2003 - 4 Sep 2003 |
Keywords
prosody, automatic punctuation, automatic speech recognition, TTS
BibTeX
@INPROCEEDINGS{KolarJ_2003_TheCzechspeechand, author = {Kol\'{a}\v{r}, J. and Romportl, J. and Psutka, J.}, title = {The Czech speech and prosody database both for ASR and TTS purposes}, year = {2003}, publisher = {ISCA}, journal = {EUROSPEECH 2003 PROCEEDINGS}, address = {Geneva}, pages = {1577-1580}, url = {http://www.kky.zcu.cz/en/publications/KolarJ_2003_TheCzechspeechand}, }