Sinmorphy: a morphological analyzer for the sinhala language
dc.contributor.author | Kumarasinghe, K | |
dc.contributor.author | Dias, G | |
dc.contributor.author | Herath, I | |
dc.contributor.editor | Adhikariwatte, W | |
dc.contributor.editor | Rathnayake, M | |
dc.contributor.editor | Hemachandra, K | |
dc.date.accessioned | 2022-10-17T05:54:40Z | |
dc.date.available | 2022-10-17T05:54:40Z | |
dc.date.issued | 2021-07 | |
dc.description.abstract | A morphological analyzer analyses a word into its lemma and a set of morphosyntactic tags. It is a crucial tool for natural language processing-related tasks especially in morphologically rich languages such as Sinhala. We present SinMorphy, the first comprehensive morphological analyser and synthesizer for the Sinhala language. SinMorphy is a rule-based system with a comprehensive vocabulary of Sinhala words. Therefore, it accurately handles a great majority of contemporary Sinhala text. It also synthesizes the lexical form of a word given a lemma and a set of tags. The system is based on a finite-state transducer and is written in the Foma and Lexc languages. It handles all types of words including nouns, verbs (including compound nouns and verbs), adjectives, adverbs, and particles. It also includes a guesser to analyze out-of-vocabulary words. It correctly analyses 81.3% of the most common 20,000 Sinhala words and 85.2% of a random test set of 1000 words. | en_US |
dc.identifier.citation | K. Kumarasinghe, G. Dias and I. Herath, "SinMorphy: A Morphological Analyzer for the Sinhala Language," 2021 Moratuwa Engineering Research Conference (MERCon), 2021, pp. 681-686, doi: 10.1109/MERCon52712.2021.9525636. | en_US |
dc.identifier.conference | Moratuwa Engineering Research Conference 2021 | en_US |
dc.identifier.department | Engineering Research Unit, University of Moratuwa | en_US |
dc.identifier.doi | 10.1109/MERCon52712.2021.9525636 | en_US |
dc.identifier.faculty | Engineering | en_US |
dc.identifier.pgnos | pp. 681-686 | |
dc.identifier.place | Moratuwa, Sri Lanka | en_US |
dc.identifier.proceeding | Proceedings of Moratuwa Engineering Research Conference 2021 | en_US |
dc.identifier.uri | http://dl.lib.uom.lk/handle/123/19113 | |
dc.identifier.year | 2021 | en_US |
dc.language.iso | en | en_US |
dc.publisher | IEEE | en_US |
dc.relation.uri | https://ieeexplore.ieee.org/document/9525636 | en_US |
dc.subject | Sinhala | en_US |
dc.subject | morphology | en_US |
dc.subject | morphological analyzer | en_US |
dc.subject | parser | en_US |
dc.subject | synthesizer | en_US |
dc.subject | natural language | en_US |
dc.subject | syntax | en_US |
dc.subject | tagger | en_US |
dc.title | Sinmorphy: a morphological analyzer for the sinhala language | en_US |
dc.type | Conference-Full-text | en_US |