Clustering sinhala news articles using corpus- based similarity measures

Nanayakkara, P; Ranathunga, S

Clustering sinhala news articles using corpus- based similarity measures

dc.contributor.author	Nanayakkara, P
dc.contributor.author	Ranathunga, S
dc.contributor.editor	Chathuranga, D
dc.date.accessioned	2022-08-22T10:04:42Z
dc.date.available	2022-08-22T10:04:42Z
dc.date.issued	2018-05
dc.description.abstract	News aggregators help readers to handle large numbers of news items in a convenient manner by collecting them into a single place with meaningful groupings. Such news aggregators/clusters are available for English and some other popular languages. However, no such tools are available for Sinhala language. To address this void, this paper presents a system to collect news articles published across the web and group related articles using corpus-based similarity measures. Despite the simplicity of the technique and morphological richness of Sinhala, we achieved very promising results that prove the viability of the presented technique.	en_US
dc.identifier.citation	P. Nanayakkara and S. Ranathunga, "Clustering Sinhala News Articles Using Corpus-Based Similarity Measures," 2018 Moratuwa Engineering Research Conference (MERCon), 2018, pp. 437-442, doi: 10.1109/MERCon.2018.8421890.	en_US
dc.identifier.conference	2018 Moratuwa Engineering Research Conference (MERCon)	en_US
dc.identifier.doi	10.1109/MERCon.2018.8421890	en_US
dc.identifier.email	nanayakkara.purnima@gmail.com	en_US
dc.identifier.email	surangika@cse.mrt.ac.lk	en_US
dc.identifier.faculty	Engineering	en_US
dc.identifier.pgnos	pp. 437-442	en_US
dc.identifier.place	Moratuwa, Sri Lanka	en_US
dc.identifier.proceeding	Proceedings of 2018 Moratuwa Engineering Research Conference (MERCon)	en_US
dc.identifier.uri	http://dl.lib.uom.lk/handle/123/18670
dc.identifier.year	2018	en_US
dc.language.iso	en	en_US
dc.publisher	IEEE	en_US
dc.relation.uri	https://ieeexplore.ieee.org/document/8421890	en_US
dc.subject	document clustering	en_US
dc.subject	Corpus-based similarity measurement	en_US
dc.subject	Sinhala	en_US
dc.title	Clustering sinhala news articles using corpus- based similarity measures	en_US
dc.type	Conference-Full-text	en_US

Collections

MERCon - 2018

Clustering sinhala news articles using corpus- based similarity measures

Files

Collections