Skip to content

Detail of publication

Citation

Kanis, J. and Müller, L. : Using lemmatization technique for automatic diacritics restoration . SPECOM 2005 proceedings, p. 255-258, Moscow State Linguistic University, Moscow , 2005.

Download PDF

PDF

Abstract

This paper is devoted to automatic construction of a lemmatizer from a Full Form - Lemma (FFL) training dictionary, and to lemmatization of new, in the FFL dictionary unseen - i.e. out-of-vocabulary (OOV), words. Three methods of lemmatization of three kinds of OOV words (missing full forms, unknown words, and compound words) are introduced. In addition, the application of lemmatizer automatic construction to the problem of automatic diacritics restoration is described.

Detail of publication

Title: Using lemmatization technique for automatic diacritics restoration
Author: Kanis, J. ; Müller, L.
Language: English
Date of publication: 17 Oct 2005
Year: 2005
Type of publication: Papers in proceedings of reviewed conferences
Title of journal or book: SPECOM 2005 proceedings
Page: 255 - 258
ISBN: 5-7452-0110-X
Publisher: Moscow State Linguistic University
Address: Moscow
Date: 17 Oct 2005 - 19 Oct 2005
/ 2008-04-18 14:31:38 /

Keywords

lemmatization, OOV words, diacritics restoration

BibTeX

@INPROCEEDINGS{KanisJ_2005_Usinglemmatization,
 author = {Kanis, J. and M\"{u}ller, L.},
 title = {Using lemmatization technique for automatic diacritics restoration},
 year = {2005},
 publisher = {Moscow State Linguistic University},
 journal = {SPECOM 2005 proceedings},
 address = {Moscow },
 pages = {255-258},
 ISBN = {5-7452-0110-X},
 url = {http://www.kky.zcu.cz/en/publications/KanisJ_2005_Usinglemmatization},
}