Title: A Hindi speech recognition system for connected words using HTK

Authors: Kuldeep Kumar; R.K. Aggarwal; Ankita Jain

Addresses: Department of Computer Engineering, National Institute of Technology, Kurukshetra-136119, Haryana, India. ' Department of Computer Engineering, National Institute of Technology, Kurukshetra-136119, Haryana, India. ' Department of Electronics and Communication Engineering, National Institute of Technology, Kurukshetra-136119, Haryana, India

Abstract: A speech recognition system converts the speech-sound into the corresponding text. The uttered speech is first understood by the machine and then the corresponding text is displayed. This paper aims to build a connected-words speech recognition system for Hindi language. The system has been developed using hidden Markov model toolkit (HTK) that uses hidden Markov models (HMMs) for recognition. The system has been trained to recognise any sequence of words selected from the vocabulary of 102 words. Initially, Mel frequency cepstral coefficients (MFCCs) have been used to extract the features from the speech-files. Then, the system has been trained to estimate the HMM parameters using word level acoustic models. The training data has been collected from 12 speakers including both males and females. The test-data used for evaluating the system-performance has been collected from the five speakers. The experiments have also been performed on the system. The experimental results show that the presented system provides the overall word-accuracy of 87.01%, word-error rate of 12.99%, and word-correction rate of 90.93% respectively. The work has been evaluated by performing the comparative analysis with the existing similar works and the betterment has been reported.

Keywords: hidden Markov models; HMMs; automatic speech recognition; ASR; hidden Markov model toolkit; HTK; Hindi; connected words; Mel frequency cepstral coefficient; MFCC.

DOI: 10.1504/IJCSYSE.2012.044740

International Journal of Computational Systems Engineering, 2012 Vol.1 No.1, pp.25 - 32

Published online: 23 Aug 2014 *

Full-text access for editors Access for subscribers Purchase this article Comment on this article