Institutional-Repository, University of Moratuwa.  

Clustering sinhala news articles using corpus- based similarity measures

Show simple item record

dc.contributor.author Nanayakkara, P
dc.contributor.author Ranathunga, S
dc.contributor.editor Chathuranga, D
dc.date.accessioned 2022-08-22T10:04:42Z
dc.date.available 2022-08-22T10:04:42Z
dc.date.issued 2018-05
dc.identifier.citation P. Nanayakkara and S. Ranathunga, "Clustering Sinhala News Articles Using Corpus-Based Similarity Measures," 2018 Moratuwa Engineering Research Conference (MERCon), 2018, pp. 437-442, doi: 10.1109/MERCon.2018.8421890. en_US
dc.identifier.uri http://dl.lib.uom.lk/handle/123/18670
dc.description.abstract News aggregators help readers to handle large numbers of news items in a convenient manner by collecting them into a single place with meaningful groupings. Such news aggregators/clusters are available for English and some other popular languages. However, no such tools are available for Sinhala language. To address this void, this paper presents a system to collect news articles published across the web and group related articles using corpus-based similarity measures. Despite the simplicity of the technique and morphological richness of Sinhala, we achieved very promising results that prove the viability of the presented technique. en_US
dc.language.iso en en_US
dc.publisher IEEE en_US
dc.relation.uri https://ieeexplore.ieee.org/document/8421890 en_US
dc.subject document clustering en_US
dc.subject Corpus-based similarity measurement en_US
dc.subject Sinhala en_US
dc.title Clustering sinhala news articles using corpus- based similarity measures en_US
dc.type Conference-Full-text en_US
dc.identifier.faculty Engineering en_US
dc.identifier.year 2018 en_US
dc.identifier.conference 2018 Moratuwa Engineering Research Conference (MERCon) en_US
dc.identifier.place Moratuwa, Sri Lanka en_US
dc.identifier.pgnos pp. 437-442 en_US
dc.identifier.proceeding Proceedings of 2018 Moratuwa Engineering Research Conference (MERCon) en_US
dc.identifier.email nanayakkara.purnima@gmail.com en_US
dc.identifier.email surangika@cse.mrt.ac.lk en_US
dc.identifier.doi 10.1109/MERCon.2018.8421890 en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record