Skip to content

Detail of publication

Citation

Méner Martin and Tihelka Daniel : The possibilities of time scale modification of speech . Speech Processing, p. 107-113, Institute of Photonics and Electronics AS CR, Praha, 2009.

Abstract

The present paper deals with time-scale modifications of speech. The aim is to utilize high quality speech signal stretching method into the project ELJABR, or more specifically, into the part of project including automatic audio track generation. The audio track is produced from subtitles by TTS (text-to-speech) system, and one of the most frequented problems is that synthetic speech often exceeds the time slot in which subtitles are displayed. Therefore, the length of synthetic speech and time-slot of subtitles must be synchronized, mostly by increasing the speed of synthetic speech, while the quality of speech must be kept. This issue can be solved by WSOLA technique.

Detail of publication

Title: The possibilities of time scale modification of speech
Author: Méner Martin ; Tihelka Daniel
Language: English
Year: 2009
Type of publication: Papers in proceedings of reviewed conferences
Title of journal or book: Speech Processing
Page: 107 - 113
ISBN: 978-80-86269-18-4
Publisher: Institute of Photonics and Electronics AS CR
Address: Praha
Date: 1 Oct 2009
2011-03-15 16:21:45 / 2011-03-15 16:21:45 / 1

Keywords

WSOLA, text-to-speech, subtitles, ELJABR

BibTeX

@INPROCEEDINGS{MenerMartin_2009_Thepossibilitiesof,
 author = {M\'{e}ner Martin and Tihelka Daniel},
 title = {The possibilities of time scale modification of speech},
 year = {2009},
 publisher = {Institute of Photonics and Electronics AS CR},
 journal = {Speech Processing},
 address = {Praha},
 pages = {107-113},
 ISBN = {978-80-86269-18-4},
 url = {http://www.kky.zcu.cz/en/publications/MenerMartin_2009_Thepossibilitiesof},
}