Title: Hindi phoneme-viseme recognition from continuous speech

Authors: A.N. Mishra; Mahesh Chandra; Astik Biswas; S.N. Sharan

Addresses: Department of ECE, Birla Institute of Technology, Mesra, Ranchi, India ' Department of ECE, Birla Institute of Technology, Mesra, Ranchi, India ' Department of ECE, IMS Engineering College, Ghaziabad, Uttar Pradesh, India ' GNIT, Greater Noida, Uttar Pradesh, India

Abstract: Automatic Speech Recognition (ASR) system performs well under restricted conditions but the performance degrades under noisy environment. Audio-visual features play an important role in ASR systems in the presence of noise. In this paper, Hindi phoneme recognition system is designed using audio-visual features. The Discrete Cosine Transform (DCT) features of the lip region integrated with Mel Frequency Cepstral Coefficient (MFCC) audio features are used to get better recognition performance under noisy environments. Colour intensity, hybrid method and Pseudo-Hue methods have been used for lip-localisation approach with Linear Discriminant Analyser (LDA) as a classifier. Recognition performance using Pseudo-Hue method proved best among all the methods.

Keywords: MFCC; mel frequency cepstral coefficient; DCT; discrete cosine transform; LDA; linear discriminant analysis; pseudo-Hue; colour intensity; viseme; Hindi phoneme-viseme recognition; continuous speech; automatic speech recognition; ASR; audio-visual features; noisy environments.

DOI: 10.1504/IJSISE.2013.054793

International Journal of Signal and Imaging Systems Engineering, 2013 Vol.6 No.3, pp.164 - 171

Received: 29 Apr 2011
Accepted: 30 Aug 2011

Published online: 13 Sep 2013 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article