Speech command recognition using self-supervised asr in Tamil
| dc.contributor.author | Tharmakulasingham, I | |
| dc.contributor.author | Thayasivam, U | |
| dc.contributor.editor | Abeysooriya, R | |
| dc.contributor.editor | Adikariwattage, V | |
| dc.contributor.editor | Hemachandra, K | |
| dc.date.accessioned | 2024-03-21T08:00:06Z | |
| dc.date.available | 2024-03-21T08:00:06Z | |
| dc.date.issued | 2023-12-09 | |
| dc.description.abstract | ASR technology has made significant advancements, approaching performance levels comparable to humans. However, developing ASR models for all languages, especially Low Resource Languages (LRLs), presents challenges due to limited resources. Recent studies on intent classification in LRLs employ transfer learning techniques, leveraging state-of-the-art English ASR models to improve performance in these domains. Additionally, the emerging trend of self-supervised learning has proven advantageous in developing sophisticated ASR models, requiring less labeled data to achieve high performance. In our research, we utilized a self-supervised ASR model to classify intent in an LRL, specifically the Tamil language. We compared two methods for the same ASR framework: a fine-tuning approach and a transfer learning approach with a limited amount of labeled data in Tamil. Our findings indicate that the fine-tuning method outperforms the transfer learning technique. Moreover, the model exhibited a noteworthy increase in accuracy compared to the established phoneme-based speech intent classification methodology in Tamil. This study represents a significant step forward in enhancing speech recognition capabilities for LRLs. | en_US |
| dc.identifier.citation | I. Tharmakulasingham and U. Thayasivam, "Speech Command Recognition Using Self-Supervised ASR in Tamil," 2023 Moratuwa Engineering Research Conference (MERCon), Moratuwa, Sri Lanka, 2023, pp. 137-142, doi: 10.1109/MERCon60487.2023.10355402. | en_US |
| dc.identifier.conference | Moratuwa Engineering Research Conference 2023 | en_US |
| dc.identifier.department | Engineering Research Unit, University of Moratuwa | en_US |
| dc.identifier.email | tharmakulasingham.21@cse.mrt.ac.lk | en_US |
| dc.identifier.email | rtuthaya@cse.mrt.ac.lk | en_US |
| dc.identifier.faculty | Engineering | en_US |
| dc.identifier.pgnos | pp. 137-142 | en_US |
| dc.identifier.place | Katubedda | en_US |
| dc.identifier.proceeding | Proceedings of Moratuwa Engineering Research Conference 2023 | en_US |
| dc.identifier.uri | http://dl.lib.uom.lk/handle/123/22361 | |
| dc.identifier.year | 2023 | en_US |
| dc.language.iso | en | en_US |
| dc.publisher | IEEE | en_US |
| dc.relation.uri | https://ieeexplore.ieee.org/document/10355402 | en_US |
| dc.subject | ASR model | en_US |
| dc.subject | Speech intent classification | en_US |
| dc.subject | Low- resource-language | en_US |
| dc.subject | Wav2Vec 2.0 | en_US |
| dc.subject | Tamil | en_US |
| dc.title | Speech command recognition using self-supervised asr in Tamil | en_US |
| dc.type | Conference-Full-text | en_US |
