Clustering sinhala news articles using corpus- based similarity measures
dc.contributor.author | Nanayakkara, P | |
dc.contributor.author | Ranathunga, S | |
dc.contributor.editor | Chathuranga, D | |
dc.date.accessioned | 2022-08-22T10:04:42Z | |
dc.date.available | 2022-08-22T10:04:42Z | |
dc.date.issued | 2018-05 | |
dc.description.abstract | News aggregators help readers to handle large numbers of news items in a convenient manner by collecting them into a single place with meaningful groupings. Such news aggregators/clusters are available for English and some other popular languages. However, no such tools are available for Sinhala language. To address this void, this paper presents a system to collect news articles published across the web and group related articles using corpus-based similarity measures. Despite the simplicity of the technique and morphological richness of Sinhala, we achieved very promising results that prove the viability of the presented technique. | en_US |
dc.identifier.citation | P. Nanayakkara and S. Ranathunga, "Clustering Sinhala News Articles Using Corpus-Based Similarity Measures," 2018 Moratuwa Engineering Research Conference (MERCon), 2018, pp. 437-442, doi: 10.1109/MERCon.2018.8421890. | en_US |
dc.identifier.conference | 2018 Moratuwa Engineering Research Conference (MERCon) | en_US |
dc.identifier.doi | 10.1109/MERCon.2018.8421890 | en_US |
dc.identifier.email | nanayakkara.purnima@gmail.com | en_US |
dc.identifier.email | surangika@cse.mrt.ac.lk | en_US |
dc.identifier.faculty | Engineering | en_US |
dc.identifier.pgnos | pp. 437-442 | en_US |
dc.identifier.place | Moratuwa, Sri Lanka | en_US |
dc.identifier.proceeding | Proceedings of 2018 Moratuwa Engineering Research Conference (MERCon) | en_US |
dc.identifier.uri | http://dl.lib.uom.lk/handle/123/18670 | |
dc.identifier.year | 2018 | en_US |
dc.language.iso | en | en_US |
dc.publisher | IEEE | en_US |
dc.relation.uri | https://ieeexplore.ieee.org/document/8421890 | en_US |
dc.subject | document clustering | en_US |
dc.subject | Corpus-based similarity measurement | en_US |
dc.subject | Sinhala | en_US |
dc.title | Clustering sinhala news articles using corpus- based similarity measures | en_US |
dc.type | Conference-Full-text | en_US |