Sinhala Short Sentence Similarity Measures using Corpus-Based Similarity for Short Answer Grading

Kadupitiya, JCS; Ranathunga, S; Dias, G

dc.contributor.author	Kadupitiya, JCS
dc.contributor.author	Ranathunga, S
dc.contributor.author	Dias, G
dc.date.accessioned	2017-01-17T10:06:24Z
dc.date.available	2017-01-17T10:06:24Z
dc.identifier.uri	http://dl.lib.mrt.ac.lk/handle/123/12253
dc.description.abstract	Currently, corpus based-similarity, string-based similarity, and knowledge-based similarity techniques are used to compare short phrases. However, no work has been conducted on the similarity of phrases in Sinhala language. In this paper, we present a hybrid methodology to compute the similarity between two Sinhala sentences using a Semantic Similarity Measurement technique (corpus-based similarity measurement plus knowledge-based similarity measurement) that makes use of word order information. Since Sinhala WordNet is still under construction, we used lexical resources in performing this semantic similarity calculation. Evaluation using 4000 sentence pairs yielded an average MSE of 0.145 and a Pearson correlation factor of 0.832.	en_US
dc.relation.uri	http://www.aclweb.org/anthology/W/W16/W16-37.pdf#page=56	en_US
dc.source.uri	http://www.aclweb.org/anthology/W/W16/W16-37.pdf#page=56	en_US
dc.title	Sinhala Short Sentence Similarity Measures using Corpus-Based Similarity for Short Answer Grading	en_US
dc.type	Article-Abstract	en_US
dc.identifier.year	2016	en_US
dc.identifier.journal	WSSANLP	en_US
dc.identifier.pgnos	44	en_US
dc.identifier.email	gihan@uom.lk	en_US