Speech to capability mapping framework for Sinhala language (Abstract)

dc.contributor.authorRanathunga, RADS
dc.contributor.authorThayasivam, U
dc.date.accessioned2025-07-21T07:12:28Z
dc.date.issued2018
dc.description.abstractMatching voice to a predefined set of capabilities is a key requirement in many applications domains such as, robotics, automation, personal devices, building management, automobile and elderly & differently abled assistant systems. The approach is to convert speech to text using Automatic Speech Recognition (ASR) and match the text to the list of predefined capabilities. Initial attempts for speech recognition made use of dynamic time warping (DTW) algorithms. Later research made use of Hidden Markov Model (HMM) based approaches that were more accurate than DTW based approaches. Recently ‘deep learning’ has shown very high quality human like performance in converting speech to text. There have been few prior research efforts towards a Sinhala ASR [1,2]. However, they have not been able to gain wider applicability and higher accuracy. This research built a Sinhala, as well as a Tamil speech dataset and used it to implement a speech to intent classification model. First version was built using Deep Learning techniques from the scratch. The second used transfer learning techniques. In transfer learning, we utilized a model trained for the English language with a large amount of English speech, and further finetuned it with the limited amount of Sinhala/Tamil speech data. This was the first research to experiment with transfer learning for speech intent classification
dc.identifier.accnoSRC200
dc.identifier.srgnoSRC/LT/2018/10
dc.identifier.urihttps://dl.lib.uom.lk/handle/123/23908
dc.language.isoen
dc.subjectSENATE RESEARCH COMMITTEE – Research Report
dc.subjectSINHALA SPEECH RECOGNIZER
dc.subjectAUTOMATIC SPEECH RECOGNITION
dc.subjectCAPABILITY MAPPING
dc.subjectSINHALA LANGUAGE
dc.titleSpeech to capability mapping framework for Sinhala language (Abstract)
dc.typeSRC-Report

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
SRC200 - Dr. RADS Ranathunga SRCLT201810 Clsng.pdf
Size:
986.83 KB
Format:
Adobe Portable Document Format
Description:
SRC Report

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: