Low resource speech intent classification using MFCC features.

Rifaza, AF

Low resource speech intent classification using MFCC features.

dc.contributor.advisor	Uthayasanker, T
dc.contributor.author	Rifaza, AF
dc.date.accept	2025
dc.date.accessioned	2026-02-10T05:16:24Z
dc.date.issued	2025
dc.description.abstract	Speech-based user interfaces have revolutionized digital interactions, yet developing them for low-resource languages remains a challenge due to limited labeled speech data. This research proposes a Convolutional Neural Network (CNN)-based approach utilizing Mel-Frequency Cepstral Coefficients (MFCC) along with delta and delta- delta features for effective speech intent classification in Sinhala and Tamil. The methodology incorporates audio preprocessing, MFCC feature extraction, and data augmentation techniques such as noise addition, pitch shifting, and time stretching. A stratified cross-validation framework is used to ensure fair and consistent evaluation. The proposed model achieves 96.92% accuracy on the Sinhala dataset (7,624 samples) and 93.81% on the Tamil dataset (400 samples, ~0.5 hours of speech), representing a substantial improvement over prior methods. These results demonstrate the effectiveness of the CNN-based approach in capturing meaningful acoustic patterns for intent recognition in low-resource settings. The study offers a scalable, efficient solution for speech intent classification and contributes to the advancement of inclusive voice-enabled technologies.
dc.identifier.accno	TH6002
dc.identifier.citation	Rifaza, A.F. (2025). Low resource speech intent classification using MFCC features. [Master’s theses, University of Moratuwa]. Institutional Repository University of Moratuwa. https://dl.lib.uom.lk/handle/123/24829
dc.identifier.department	Department of Computer Science & Engineering
dc.identifier.faculty	Engineering
dc.identifier.uri	https://dl.lib.uom.lk/handle/123/24829
dc.language.iso	en
dc.subject	SPPECH RECOGNITION-Speech Intent Classification
dc.subject	CONVOLUTIONAL NEURAL NETWORKS
dc.subject	HUMAN LANGUAGES-Low-Resource Languages
dc.subject	MACHINE LEARNING-Transfer Learning
dc.subject	SOUND-Mel-Frequency Cepstral Coefficients
dc.subject	COMPUTER SCIENCE-Dissertation
dc.subject	COMPUTER SCIENCE AND ENGINEERING-Dissertation
dc.subject	MSc in Computer Science
dc.title	Low resource speech intent classification using MFCC features.
dc.type	Thesis-Full-text

Files

Original bundle

Now showing 1 - 3 of 3

Name:: TH6002-1.pdf
Size:: 1.11 MB
Format:: Adobe Portable Document Format
Description:: Pre-text

Download

Name:: TH6002-2.pdf
Size:: 152.78 KB
Format:: Adobe Portable Document Format
Description:: Post-text

Download

Name:: TH6002.pdf
Size:: 1.41 MB
Format:: Adobe Portable Document Format
Description:: Full-thesis

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Master of Science in Computer science and Engineering