Title: Robustness evaluation of wavelet based features for continuous speech recognition

Authors: Omar Farooq, Sekharjit Datta

Addresses: Department of Electronics Engineering, ZH College of Engineering and Technology, Aligarh Muslim University, Aligarh, UP 202 002, India. ' Department of Electronic and Electrical Engineering, Loughborough University, Loughborough, LE11 3TU, UK

Abstract: This article evaluates robustness of admissible wavelet packet based features for continuous speech recognition. The recognition accuracy is compared with the standard mel frequency cepstral coefficients (MFCC) under clean and noisy environment. This is carried out by adding white Gaussian and speech noise to the phonemes of the TIMIT database to generate different levels of signal to noise ratio. Further, a wavelet based denoising technique is proposed as a front-end for noise reduction. Soft and hard thresholding techniques are used with one-level and two-level wavelet based denoising. The speech recogniser with Continuous Density Hidden Markov|s Model is used to model the phonemes for the word recognition task. The recognition performance achieved with denoising of the input speech shows improvement as compared to without denoising of both the MFCC and the wavelet based features.

Keywords: ASR; automatic speech recognition; feature extraction; continuous density HMM; hidden Markov models; wavelet denoising; wavelet transforms; robustness; admissible wavelet packets; phonemes; word recognition.

DOI: 10.1504/IJISTA.2010.033893

International Journal of Intelligent Systems Technologies and Applications, 2010 Vol.9 No.1, pp.1 - 14

Received: 05 Feb 2008
Accepted: 19 Nov 2008

Published online: 03 Jul 2010 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article