Speech command recognition using self-supervised asr in Tamil

dc.contributor.authorTharmakulasingham, I
dc.contributor.authorThayasivam, U
dc.contributor.editorAbeysooriya, R
dc.contributor.editorAdikariwattage, V
dc.contributor.editorHemachandra, K
dc.date.accessioned2024-03-21T08:00:06Z
dc.date.available2024-03-21T08:00:06Z
dc.date.issued2023-12-09
dc.description.abstractASR technology has made significant advancements, approaching performance levels comparable to humans. However, developing ASR models for all languages, especially Low Resource Languages (LRLs), presents challenges due to limited resources. Recent studies on intent classification in LRLs employ transfer learning techniques, leveraging state-of-the-art English ASR models to improve performance in these domains. Additionally, the emerging trend of self-supervised learning has proven advantageous in developing sophisticated ASR models, requiring less labeled data to achieve high performance. In our research, we utilized a self-supervised ASR model to classify intent in an LRL, specifically the Tamil language. We compared two methods for the same ASR framework: a fine-tuning approach and a transfer learning approach with a limited amount of labeled data in Tamil. Our findings indicate that the fine-tuning method outperforms the transfer learning technique. Moreover, the model exhibited a noteworthy increase in accuracy compared to the established phoneme-based speech intent classification methodology in Tamil. This study represents a significant step forward in enhancing speech recognition capabilities for LRLs.en_US
dc.identifier.citationI. Tharmakulasingham and U. Thayasivam, "Speech Command Recognition Using Self-Supervised ASR in Tamil," 2023 Moratuwa Engineering Research Conference (MERCon), Moratuwa, Sri Lanka, 2023, pp. 137-142, doi: 10.1109/MERCon60487.2023.10355402.en_US
dc.identifier.conferenceMoratuwa Engineering Research Conference 2023en_US
dc.identifier.departmentEngineering Research Unit, University of Moratuwaen_US
dc.identifier.emailtharmakulasingham.21@cse.mrt.ac.lken_US
dc.identifier.emailrtuthaya@cse.mrt.ac.lken_US
dc.identifier.facultyEngineeringen_US
dc.identifier.pgnospp. 137-142en_US
dc.identifier.placeKatubeddaen_US
dc.identifier.proceedingProceedings of Moratuwa Engineering Research Conference 2023en_US
dc.identifier.urihttp://dl.lib.uom.lk/handle/123/22361
dc.identifier.year2023en_US
dc.language.isoenen_US
dc.publisherIEEEen_US
dc.relation.urihttps://ieeexplore.ieee.org/document/10355402en_US
dc.subjectASR modelen_US
dc.subjectSpeech intent classificationen_US
dc.subjectLow- resource-languageen_US
dc.subjectWav2Vec 2.0en_US
dc.subjectTamilen_US
dc.titleSpeech command recognition using self-supervised asr in Tamilen_US
dc.typeConference-Full-texten_US

Files

Collections