Ananya - a named-entity-recognition (ner) system for sinhala language
dc.contributor.author | Manamini, SAPM | |
dc.contributor.author | Ahamed, AF | |
dc.contributor.author | Rajapakshe, RAEC | |
dc.contributor.author | Reemal, GHA | |
dc.contributor.author | Jayasena, S | |
dc.contributor.author | Dias, GV | |
dc.contributor.author | Ranathunga, S | |
dc.contributor.editor | Jayasekara, AGBP | |
dc.contributor.editor | Bandara, HMND | |
dc.contributor.editor | Amarasinghe, YWR | |
dc.date.accessioned | 2022-09-08T09:16:35Z | |
dc.date.available | 2022-09-08T09:16:35Z | |
dc.date.issued | 2016-04 | |
dc.description.abstract | Named-Entity-Recognition (NER) is one of the major tasks under Natural Language Processing, which is widely used in the fields of Computer Science and Computational Linguistics. However, the amount of prior research done on NER for Sinhala is very minimal. In this paper, we present data-driven techniques to detect Named Entities in Sinhala text, with the use of Conditional Random Fields (CRF) and Maximum Entropy (ME) statistical modeling methods. Results obtained from experiments indicate that CRF, which provided the highest accuracy for the same task for other languages outperforms ME in Sinhala NER as well. Furthermore, we identify different linguistic features such as orthographic word level and contextual information that are effective with both CRF and ME Algorithms. | en_US |
dc.identifier.citation | S. A. P. M. Manamini et al., "Ananya - a Named-Entity-Recognition (NER) system for Sinhala language," 2016 Moratuwa Engineering Research Conference (MERCon), 2016, pp. 30-35, doi: 10.1109/MERCon.2016.7480111. | en_US |
dc.identifier.conference | 2016 Moratuwa Engineering Research Conference (MERCon) | en_US |
dc.identifier.department | Engineering Research Unit, University of Moratuwa | en_US |
dc.identifier.doi | 10.1109/MERCon.2016.7480111 | en_US |
dc.identifier.email | prabushi.11@cse.mrt.ac.lk | en_US |
dc.identifier.email | eranda.11@cse.mrt.ac.lk | en_US |
dc.identifier.email | achintha.11@cse.mrt.ac.lk | en_US |
dc.identifier.email | farazath.11@cse.mrt.ac.lk | en_US |
dc.identifier.email | sanath@cse.mrt.ac.lk | en_US |
dc.identifier.email | gihan@cse.mrt.ac.lk | en_US |
dc.identifier.email | surangika@cse.mrt.ac.lk | en_US |
dc.identifier.faculty | Engineering | en_US |
dc.identifier.pgnos | pp. 30-35 | en_US |
dc.identifier.place | Moratuwa, Sri Lanka | en_US |
dc.identifier.proceeding | Proceedings of 2016 Moratuwa Engineering Research Conference (MERCon) | en_US |
dc.identifier.uri | http://dl.lib.uom.lk/handle/123/18992 | |
dc.identifier.year | 2016 | en_US |
dc.language.iso | en | en_US |
dc.publisher | IEEE | en_US |
dc.relation.uri | https://ieeexplore.ieee.org/document/7480111 | en_US |
dc.subject | Named Entity Recognition | en_US |
dc.subject | Conditional Random Fields | en_US |
dc.subject | Maximum Entropy Model | en_US |
dc.subject | Sinhala language | en_US |
dc.subject | Natural Language Processing | en_US |
dc.title | Ananya - a named-entity-recognition (ner) system for sinhala language | en_US |
dc.type | Conference-Full-text | en_US |