Statistical Machine Translation System for Sinhala and Tamil Languages

dc.contributor.authorRajpirathap, S
dc.contributor.authorSheeyam, S
dc.contributor.authorUmasuthan, K
dc.contributor.authorChelvarajah, A
dc.date.accessioned2017-04-03T03:44:23Z
dc.date.available2017-04-03T03:44:23Z
dc.description.abstractStatistical machine translation method is one of the most promising and efficient method to perform machine translation for Sri Lankan languages likes Sinhala and Tamil. Statistical approach is more suitable for structurally dissimilar pairs of languages and efficient solution for large text translation. In Sri Lanka we have a rising need for translation for Sinhala and Tamil and the statistical machine translation approach is more suitable for the concerned languages. Sinhala and Tamil have a similarity in grammar and statistical approach will help to obtain more accurate results. We have developed a bi-directional translation system for both Tamil to Sinhala and Sinhala to Tamil for this research. We have used the Sri Lankan parliament corpus to train the language model. We have critically evaluated the both systems with parameter optimizations and have obtained the most accurate and efficient system. We have also utilized the scoring techniques like BLEU [2, 8] & NIST [2] for the system evaluation and we have integrated the MERT technique to tune the decoder.en_US
dc.identifier.conferenceITRU RESEARCH SYMPOSIUMen_US
dc.identifier.departmentDepartment of Information Technologyen_US
dc.identifier.emailnova-fit10@googlegroups.comen_US
dc.identifier.emailamalraj@uom.lken_US
dc.identifier.facultyITen_US
dc.identifier.pgnos38-43en_US
dc.identifier.placeUNIVERSITY OF MORATUWAen_US
dc.identifier.urihttp://dl.lib.mrt.ac.lk/handle/123/12629
dc.identifier.year2014en_US
dc.language.isoenen_US
dc.titleStatistical Machine Translation System for Sinhala and Tamil Languagesen_US
dc.typeConference-Full-texten_US

Files

Collections