End To End Model For Speaker Identification With Minimal Training Data

dc.contributor.authorBalakrishnan, S
dc.contributor.authorJathusan, K
dc.contributor.authorThayasivam, U
dc.contributor.editorAdhikariwatte, W
dc.contributor.editorRathnayake, M
dc.contributor.editorHemachandra, K
dc.date.accessioned2022-10-20T03:00:15Z
dc.date.available2022-10-20T03:00:15Z
dc.date.issued2021-07
dc.description.abstractDeep learning has achieved immense universality by outperforming GMM and i-vectors on speaker identification. Neural Network approaches have obtained promising results when fed by raw speech samples directly. Modified Convolutional Neural Network (CNN) architecture called SincNet, based on parameterized sinc functions which offer a very compact way to derive a customized filter bank in the short utterance. This paper proposes attention based Long Short Term Memory (LSTM) architecture that encourages discovering more meaningful speaker-related features with minimal training data. Attention layer built using Neural Networks offers a unique and efficient representation of the speaker characteristics which explore the connection between an aspect and the content of short utterances. The proposed approach converges faster and performs better than the SincNet on the experiments carried out in the speaker identification tasks.en_US
dc.identifier.citationS. Balakrishnan, K. Jathusan and U. Thayasivam, "End To End Model For Speaker Identification With Minimal Training Data," 2021 Moratuwa Engineering Research Conference (MERCon), 2021, pp. 456-461, doi: 10.1109/MERCon52712.2021.9525740.en_US
dc.identifier.conferenceMoratuwa Engineering Research Conference 2021en_US
dc.identifier.departmentEngineering Research Unit, University of Moratuwaen_US
dc.identifier.doi10.1109/MERCon52712.2021.9525740en_US
dc.identifier.facultyEngineeringen_US
dc.identifier.pgnospp. 456-461en_US
dc.identifier.placeMoratuwa, Sri Lankaen_US
dc.identifier.proceedingProceedings of Moratuwa Engineering Research Conference 2021en_US
dc.identifier.urihttp://dl.lib.uom.lk/handle/123/19152
dc.identifier.year2021en_US
dc.language.isoenen_US
dc.publisherIEEEen_US
dc.relation.urihttps://ieeexplore.ieee.org/document/9525740en_US
dc.subjectSpeaker recognitionen_US
dc.subjectNeural networksen_US
dc.subjectAttention layeren_US
dc.titleEnd To End Model For Speaker Identification With Minimal Training Dataen_US
dc.typeConference-Full-texten_US

Files

Collections