Title: Hindi phoneme-viseme recognition from continuous speech
Authors: A.N. Mishra; Mahesh Chandra; Astik Biswas; S.N. Sharan
Addresses: Department of ECE, Birla Institute of Technology, Mesra, Ranchi, India ' Department of ECE, Birla Institute of Technology, Mesra, Ranchi, India ' Department of ECE, IMS Engineering College, Ghaziabad, Uttar Pradesh, India ' GNIT, Greater Noida, Uttar Pradesh, India
Abstract: Automatic Speech Recognition (ASR) system performs well under restricted conditions but the performance degrades under noisy environment. Audio-visual features play an important role in ASR systems in the presence of noise. In this paper, Hindi phoneme recognition system is designed using audio-visual features. The Discrete Cosine Transform (DCT) features of the lip region integrated with Mel Frequency Cepstral Coefficient (MFCC) audio features are used to get better recognition performance under noisy environments. Colour intensity, hybrid method and Pseudo-Hue methods have been used for lip-localisation approach with Linear Discriminant Analyser (LDA) as a classifier. Recognition performance using Pseudo-Hue method proved best among all the methods.
Keywords: MFCC; mel frequency cepstral coefficient; DCT; discrete cosine transform; LDA; linear discriminant analysis; pseudo-Hue; colour intensity; viseme; Hindi phoneme-viseme recognition; continuous speech; automatic speech recognition; ASR; audio-visual features; noisy environments.
DOI: 10.1504/IJSISE.2013.054793
International Journal of Signal and Imaging Systems Engineering, 2013 Vol.6 No.3, pp.164 - 171
Received: 29 Apr 2011
Accepted: 30 Aug 2011
Published online: 13 Sep 2013 *