Publications
Detail of publication
Citation
p. 355-361, Springer, Berlin, 2004. : Using the lemmatization technique for phonetic transcription in text-to-speech system . Text, speech and dialogue,
Abstract
This paper deals with a lemmatization technique and its using for phonetic transcription of exceptional words. The lemmatizer is based on language morphology and uses a lexicon of basic word forms and a set of inversion derivation rules to acquire lemmatization rules, which are essential for finding word bases. The lemmatization algorithm and its necessary modifications for transcription of exceptional words are described. The main goal of the designed system is to save computer memory for exceptional lexicon storing. The experimental results showed that it is possible to save from 18.3 % (English) to 98.4 % (Finnish) of the full lexicon size. Hence, the described technique can be applied with advantage for high inflectional and agglutinative languages.
Detail of publication
Title: | Using the lemmatization technique for phonetic transcription in text-to-speech system |
---|---|
Author: | Kanis, J. ; Müller, L. |
Language: | English |
Date of publication: | 8 Sep 2004 |
Year: | 2004 |
Type of publication: | Papers in proceedings of reviewed conferences |
Title of journal or book: | Text, speech and dialogue |
Page: | 355 - 361 |
ISBN: | 3-540-23049-1 |
Publisher: | Springer |
Address: | Berlin |
Date: | 8 Sep 2004 - 11 Sep 2004 |
Keywords
lemmatization, phonetic transcription, exceptions from letter to sound conversion
BibTeX
@INPROCEEDINGS{KanisJ_2004_Usingthe, author = {Kanis, J. and M\"{u}ller, L.}, title = {Using the lemmatization technique for phonetic transcription in text-to-speech system}, year = {2004}, publisher = {Springer}, journal = {Text, speech and dialogue}, address = {Berlin}, pages = {355-361}, ISBN = {3-540-23049-1}, url = {http://www.kky.zcu.cz/en/publications/KanisJ_2004_Usingthe}, }