Publications
Detail of publication
Citation
p. 255-258, Moscow State Linguistic University, Moscow , 2005. : Using lemmatization technique for automatic diacritics restoration . SPECOM 2005 proceedings,
Download PDF
Abstract
This paper is devoted to automatic construction of a lemmatizer from a Full Form - Lemma (FFL) training dictionary, and to lemmatization of new, in the FFL dictionary unseen - i.e. out-of-vocabulary (OOV), words. Three methods of lemmatization of three kinds of OOV words (missing full forms, unknown words, and compound words) are introduced. In addition, the application of lemmatizer automatic construction to the problem of automatic diacritics restoration is described.
Detail of publication
Title: | Using lemmatization technique for automatic diacritics restoration |
---|---|
Author: | Kanis, J. ; Müller, L. |
Language: | English |
Date of publication: | 17 Oct 2005 |
Year: | 2005 |
Type of publication: | Papers in proceedings of reviewed conferences |
Title of journal or book: | SPECOM 2005 proceedings |
Page: | 255 - 258 |
ISBN: | 5-7452-0110-X |
Publisher: | Moscow State Linguistic University |
Address: | Moscow |
Date: | 17 Oct 2005 - 19 Oct 2005 |
Keywords
lemmatization, OOV words, diacritics restoration
BibTeX
@INPROCEEDINGS{KanisJ_2005_Usinglemmatization, author = {Kanis, J. and M\"{u}ller, L.}, title = {Using lemmatization technique for automatic diacritics restoration}, year = {2005}, publisher = {Moscow State Linguistic University}, journal = {SPECOM 2005 proceedings}, address = {Moscow }, pages = {255-258}, ISBN = {5-7452-0110-X}, url = {http://www.kky.zcu.cz/en/publications/KanisJ_2005_Usinglemmatization}, }