dc.contributor.author |
Wijesiri, I |
|
dc.contributor.author |
Gallage, M |
|
dc.contributor.author |
Gunathilaka, B |
|
dc.contributor.author |
Lakjeewa, M |
|
dc.contributor.author |
Wimalasuriya, DC |
|
dc.contributor.author |
Dias, G |
|
dc.contributor.author |
Paranavithana, R |
|
dc.contributor.author |
De Silva, N |
|
dc.date.accessioned |
2017-01-16T04:01:19Z |
|
dc.date.available |
2017-01-16T04:01:19Z |
|
dc.identifier.uri |
http://dl.lib.mrt.ac.lk/handle/123/12222 |
|
dc.description.abstract |
Sinhala is one of the official languages of Sri Lanka and is used by over 19 million people. It belongs to the Indo-Aryan branch of the In-do-European languages and its origins date back to at least 2000 years. It has developed into its current form over a long period of time with influences from a wide variety of lan-guages including Tamil, Portuguese and Eng-lish. As for any other language, a WordNet is extremely important for Sinhala to take it into the digital era. This paper is based on the pro-ject to develop a WordNet for Sinhala based on the English (Princeton) WordNet. It de-scribes how we overcame the challenges in adding Sinhala specific characteristics which were deemed important by Sinhala language experts to the WordNet while keeping the structure of the original English WordNet. It also presents the details of the crowdsourcing system we developed as a part of the project - consisting of a NoSQL database in the backend and a web-based frontend. We con-clude by discussing the possibility of adapting this architecture for other languages and the road ahead for the Sinhala WordNet and Sin-hala NLP. |
en_US |
dc.relation.uri |
http://www.aclweb.org/anthology/W/W14/W14-0114.pdf |
en_US |
dc.source.uri |
http://www.aclweb.org/anthology/W/W14/W14-0114.pdf |
en_US |
dc.title |
Building a WordNet for Sinhala |
en_US |
dc.type |
Article-Abstract |
en_US |
dc.identifier.year |
2014 |
en_US |
dc.identifier.journal |
Volume editors |
en_US |
dc.identifier.pgnos |
100 |
en_US |
dc.identifier.email |
gihan@uom.lk |
en_US |
dc.identifier.email |
nisansadds@cse.mrt.ac.lk |
en_US |