dc.contributor.author |
Kumarasinghe, K |
|
dc.contributor.author |
Dias, G |
|
dc.contributor.author |
Herath, I |
|
dc.contributor.editor |
Adhikariwatte, W |
|
dc.contributor.editor |
Rathnayake, M |
|
dc.contributor.editor |
Hemachandra, K |
|
dc.date.accessioned |
2022-10-17T05:54:40Z |
|
dc.date.available |
2022-10-17T05:54:40Z |
|
dc.date.issued |
2021-07 |
|
dc.identifier.citation |
K. Kumarasinghe, G. Dias and I. Herath, "SinMorphy: A Morphological Analyzer for the Sinhala Language," 2021 Moratuwa Engineering Research Conference (MERCon), 2021, pp. 681-686, doi: 10.1109/MERCon52712.2021.9525636. |
en_US |
dc.identifier.uri |
http://dl.lib.uom.lk/handle/123/19113 |
|
dc.description.abstract |
A morphological analyzer analyses a word into its lemma and a set of morphosyntactic tags. It is a crucial tool for natural language processing-related tasks especially in morphologically rich languages such as Sinhala. We present SinMorphy, the first comprehensive morphological analyser and synthesizer for the Sinhala language. SinMorphy is a rule-based system with a comprehensive vocabulary of Sinhala words. Therefore, it accurately handles a great majority of contemporary Sinhala text. It also synthesizes the lexical form of a word given a lemma and a set of tags. The system is based on a finite-state transducer and is written in the Foma and Lexc languages. It handles all types of words including nouns, verbs (including compound nouns and verbs), adjectives, adverbs, and particles. It also includes a guesser to analyze out-of-vocabulary words. It correctly analyses 81.3% of the most common 20,000 Sinhala words and 85.2% of a random test set of 1000 words. |
en_US |
dc.language.iso |
en |
en_US |
dc.publisher |
IEEE |
en_US |
dc.relation.uri |
https://ieeexplore.ieee.org/document/9525636 |
en_US |
dc.subject |
Sinhala |
en_US |
dc.subject |
morphology |
en_US |
dc.subject |
morphological analyzer |
en_US |
dc.subject |
parser |
en_US |
dc.subject |
synthesizer |
en_US |
dc.subject |
natural language |
en_US |
dc.subject |
syntax |
en_US |
dc.subject |
tagger |
en_US |
dc.title |
Sinmorphy: a morphological analyzer for the sinhala language |
en_US |
dc.type |
Conference-Full-text |
en_US |
dc.identifier.faculty |
Engineering |
en_US |
dc.identifier.department |
Engineering Research Unit, University of Moratuwa |
en_US |
dc.identifier.year |
2021 |
en_US |
dc.identifier.conference |
Moratuwa Engineering Research Conference 2021 |
en_US |
dc.identifier.place |
Moratuwa, Sri Lanka |
en_US |
dc.identifier.pgnos |
pp. 681-686 |
|
dc.identifier.proceeding |
Proceedings of Moratuwa Engineering Research Conference 2021 |
en_US |
dc.identifier.doi |
10.1109/MERCon52712.2021.9525636 |
en_US |