Sinhala speech recognition for interactive voice response systems accessed through mobile phones
Loading...
Date
2018-05
Journal Title
Journal ISSN
Volume Title
Publisher
IEEE
Abstract
This paper presents the development of a Sinhala
Speech Recognition System to be deployed in an Interactive Voice
Response (IVR) system of a telecommunication service provider.
The main objectives are to recognize Sinhala digits and names
of Sinhala songs to be set up as ringback tones. Sinhala being a
phonetic language, its features are studied to develop a list of 47
phonemes. A continuous speech recognition system is developed
based on Hidden Markov Model (HMM). The acoustic model is
trained using the voice through mobile phone. The outcome is a
speaker independent speech recognition system which is capable
of recognizing 10 digits and 50 Sinhala songs. A word error
rate (WER) of 11.2% using a speech corpus of 0.862 hours and
a sentence error rate (SER) of 5.7% using a speech corpus of
1.388 hours are achieved for digits and songs respectively.
Description
Keywords
Citation
W. Manamperi, D. Karunathilake, T. Madhushani, N. Galagedara and D. Dias, "Sinhala Speech Recognition for Interactive Voice Response Systems Accessed Through Mobile Phones," 2018 Moratuwa Engineering Research Conference (MERCon), 2018, pp. 241-246, doi: 10.1109/MERCon.2018.8421888.