Sinhala Short Sentence Similarity Measures using Corpus-Based Similarity for Short Answer Grading

Kadupitiya, JCS; Ranathunga, S; Dias, G

Sinhala Short Sentence Similarity Measures using Corpus-Based Similarity for Short Answer Grading

Authors

Kadupitiya, JCS

Ranathunga, S

Dias, G

Abstract

Currently, corpus based-similarity, string-based similarity, and knowledge-based similarity techniques are used to compare short phrases. However, no work has been conducted on the similarity of phrases in Sinhala language. In this paper, we present a hybrid methodology to compute the similarity between two Sinhala sentences using a Semantic Similarity Measurement technique (corpus-based similarity measurement plus knowledge-based similarity measurement) that makes use of word order information. Since Sinhala WordNet is still under construction, we used lexical resources in performing this semantic similarity calculation. Evaluation using 4000 sentence pairs yielded an average MSE of 0.145 and a Pearson correlation factor of 0.832.

URI

http://dl.lib.mrt.ac.lk/handle/123/12253

Collections

Articles authored by UoM staff

Full item page

Sinhala Short Sentence Similarity Measures using Corpus-Based Similarity for Short Answer Grading

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

DOI

Collections

Endorsement

Review

Supplemented By

Referenced By