Enhanced time delay neural network architectures for sinhala speech recognition

dc.contributor.authorWarusawithana, D
dc.contributor.authorKulaweera, N
dc.contributor.authorWeerasinghe, L
dc.contributor.authorKarunarathne, B
dc.contributor.editorRathnayake, M
dc.contributor.editorAdhikariwatte, V
dc.contributor.editorHemachandra, K
dc.date.accessioned2022-10-27T08:47:15Z
dc.date.available2022-10-27T08:47:15Z
dc.date.issued2022-07
dc.description.abstractAutomatic Speech Recognition (ASR) has become a fast-growing research domain due to advancements in Machine Learning. In addition to the development of large training corpora, the introduction of novel architectures for ASR models has contributed to defining new boundaries for the performance of speech recognition systems. However, there is a significant difference in speech recognition accuracy between major world languages and low-resourced languages such as Sinhala, due to inadequate research. We have applied enhanced time-delay neural network architectures for acoustic modeling in Sinhala ASR, including the Multistream CNN architecture. Using the Kaldi ASR Toolkit, we have trained ASR models with a publicly available corpus of over 200 hours of speech data. The results show a remarkable improvement in the accuracy of Sinhala speech recognition as demonstrated by a reduction in the Word-Error-Rate (WER) to 25.12%.en_US
dc.identifier.citationD. Warusawithana, N. Kulaweera, L. Weerasinghe and B. Karunarathne, "Enhanced Time Delay Neural Network Architectures for Sinhala Speech Recognition," 2022 Moratuwa Engineering Research Conference (MERCon), 2022, pp. 1-6, doi: 10.1109/MERCon55799.2022.9906216.en_US
dc.identifier.conferenceMoratuwa Engineering Research Conference 2022en_US
dc.identifier.departmentEngineering Research Unit, University of Moratuwaen_US
dc.identifier.doi10.1109/MERCon55799.2022.9906216en_US
dc.identifier.emaildisurawaru.17@cse.mrt.ac.lk
dc.identifier.emailrukshilakulaweera.17@cse.mrt.ac.lk
dc.identifier.emaillakshan.17@cse.mrt.ac.lk
dc.identifier.emailbuddhika@cse.mrt.ac.lk
dc.identifier.facultyEngineeringen_US
dc.identifier.placeMoratuwa, Sri Lankaen_US
dc.identifier.proceedingProceedings of Moratuwa Engineering Research Conference 2022en_US
dc.identifier.urihttp://dl.lib.uom.lk/handle/123/19272
dc.identifier.year2022en_US
dc.language.isoenen_US
dc.publisherIEEEen_US
dc.relation.urihttps://ieeexplore.ieee.org/document/9906216en_US
dc.subjectSinhalaen_US
dc.subjectASRen_US
dc.subjectfactored TDNNen_US
dc.subjectMultistream CNNen_US
dc.titleEnhanced time delay neural network architectures for sinhala speech recognitionen_US
dc.typeConference-Full-texten_US

Files

Collections