Sinmorphy: a morphological analyzer for the sinhala language

dc.contributor.authorKumarasinghe, K
dc.contributor.authorDias, G
dc.contributor.authorHerath, I
dc.contributor.editorAdhikariwatte, W
dc.contributor.editorRathnayake, M
dc.contributor.editorHemachandra, K
dc.date.accessioned2022-10-17T05:54:40Z
dc.date.available2022-10-17T05:54:40Z
dc.date.issued2021-07
dc.description.abstractA morphological analyzer analyses a word into its lemma and a set of morphosyntactic tags. It is a crucial tool for natural language processing-related tasks especially in morphologically rich languages such as Sinhala. We present SinMorphy, the first comprehensive morphological analyser and synthesizer for the Sinhala language. SinMorphy is a rule-based system with a comprehensive vocabulary of Sinhala words. Therefore, it accurately handles a great majority of contemporary Sinhala text. It also synthesizes the lexical form of a word given a lemma and a set of tags. The system is based on a finite-state transducer and is written in the Foma and Lexc languages. It handles all types of words including nouns, verbs (including compound nouns and verbs), adjectives, adverbs, and particles. It also includes a guesser to analyze out-of-vocabulary words. It correctly analyses 81.3% of the most common 20,000 Sinhala words and 85.2% of a random test set of 1000 words.en_US
dc.identifier.citationK. Kumarasinghe, G. Dias and I. Herath, "SinMorphy: A Morphological Analyzer for the Sinhala Language," 2021 Moratuwa Engineering Research Conference (MERCon), 2021, pp. 681-686, doi: 10.1109/MERCon52712.2021.9525636.en_US
dc.identifier.conferenceMoratuwa Engineering Research Conference 2021en_US
dc.identifier.departmentEngineering Research Unit, University of Moratuwaen_US
dc.identifier.doi10.1109/MERCon52712.2021.9525636en_US
dc.identifier.facultyEngineeringen_US
dc.identifier.pgnospp. 681-686
dc.identifier.placeMoratuwa, Sri Lankaen_US
dc.identifier.proceedingProceedings of Moratuwa Engineering Research Conference 2021en_US
dc.identifier.urihttp://dl.lib.uom.lk/handle/123/19113
dc.identifier.year2021en_US
dc.language.isoenen_US
dc.publisherIEEEen_US
dc.relation.urihttps://ieeexplore.ieee.org/document/9525636en_US
dc.subjectSinhalaen_US
dc.subjectmorphologyen_US
dc.subjectmorphological analyzeren_US
dc.subjectparseren_US
dc.subjectsynthesizeren_US
dc.subjectnatural languageen_US
dc.subjectsyntaxen_US
dc.subjecttaggeren_US
dc.titleSinmorphy: a morphological analyzer for the sinhala languageen_US
dc.typeConference-Full-texten_US

Files

Collections