Publications
Detail of publication
Citation
p. 107-113, Institute of Photonics and Electronics AS CR, Praha, 2009. : The possibilities of time scale modification of speech . Speech Processing,
Abstract
The present paper deals with time-scale modifications of speech. The aim is to utilize high quality speech signal stretching method into the project ELJABR, or more specifically, into the part of project including automatic audio track generation. The audio track is produced from subtitles by TTS (text-to-speech) system, and one of the most frequented problems is that synthetic speech often exceeds the time slot in which subtitles are displayed. Therefore, the length of synthetic speech and time-slot of subtitles must be synchronized, mostly by increasing the speed of synthetic speech, while the quality of speech must be kept. This issue can be solved by WSOLA technique.
Detail of publication
Title: | The possibilities of time scale modification of speech |
---|---|
Author: | Méner Martin ; Tihelka Daniel |
Language: | English |
Year: | 2009 |
Type of publication: | Papers in proceedings of reviewed conferences |
Title of journal or book: | Speech Processing |
Page: | 107 - 113 |
ISBN: | 978-80-86269-18-4 |
Publisher: | Institute of Photonics and Electronics AS CR |
Address: | Praha |
Date: | 1 Oct 2009 |
Keywords
WSOLA, text-to-speech, subtitles, ELJABR
BibTeX
@INPROCEEDINGS{MenerMartin_2009_Thepossibilitiesof, author = {M\'{e}ner Martin and Tihelka Daniel}, title = {The possibilities of time scale modification of speech}, year = {2009}, publisher = {Institute of Photonics and Electronics AS CR}, journal = {Speech Processing}, address = {Praha}, pages = {107-113}, ISBN = {978-80-86269-18-4}, url = {http://www.kky.zcu.cz/en/publications/MenerMartin_2009_Thepossibilitiesof}, }