Skip to content

Detail of publication

Citation

Kanis, J. and Müller, L. : Using the lemmatization technique for phonetic transcription in text-to-speech system . Text, speech and dialogue, p. 355-361, Springer, Berlin, 2004.

Abstract

This paper deals with a lemmatization technique and its using for phonetic transcription of exceptional words. The lemmatizer is based on language morphology and uses a lexicon of basic word forms and a set of inversion derivation rules to acquire lemmatization rules, which are essential for finding word bases. The lemmatization algorithm and its necessary modifications for transcription of exceptional words are described. The main goal of the designed system is to save computer memory for exceptional lexicon storing. The experimental results showed that it is possible to save from 18.3 % (English) to 98.4 % (Finnish) of the full lexicon size. Hence, the described technique can be applied with advantage for high inflectional and agglutinative languages.

Detail of publication

Title: Using the lemmatization technique for phonetic transcription in text-to-speech system
Author: Kanis, J. ; Müller, L.
Language: English
Date of publication: 8 Sep 2004
Year: 2004
Type of publication: Papers in proceedings of reviewed conferences
Title of journal or book: Text, speech and dialogue
Page: 355 - 361
ISBN: 3-540-23049-1
Publisher: Springer
Address: Berlin
Date: 8 Sep 2004 - 11 Sep 2004
/ 2008-04-18 14:19:02 /

Keywords

lemmatization, phonetic transcription, exceptions from letter to sound conversion

BibTeX

@INPROCEEDINGS{KanisJ_2004_Usingthe,
 author = {Kanis, J. and M\"{u}ller, L.},
 title = {Using the lemmatization technique for phonetic transcription in text-to-speech system},
 year = {2004},
 publisher = {Springer},
 journal = {Text, speech and dialogue},
 address = {Berlin},
 pages = {355-361},
 ISBN = {3-540-23049-1},
 url = {http://www.kky.zcu.cz/en/publications/KanisJ_2004_Usingthe},
}