Title: A new GA optimised Reliability Ratio based integration weight estimation scheme for decision fusion Audio-Visual Speech Recognition

Authors: R. Rajavel, P.S. Sathidevi

Addresses: National Institute of Technology Calicut, Department of Electronics and Communication Engineering, Calicut 673601, India. ' National Institute of Technology Calicut, Department of Electronics and Communication Engineering, Calicut 673601, India

Abstract: Audio-Visual Speech Recognition (AVSR) using acoustic and visual signals of speech have received attention recently because of its robustness in noisy environments. An important issue in decision fusion AVSR systems is the determination of appropriate integration weight for better performance. A new Genetic Algorithm (GA) based scheme to obtain an appropriate integration weight is proposed here. The performance of the proposed scheme is demonstrated for commonly used mobile functions isolated word recognition via multi-speaker database experiment. The results show that the proposed scheme improves robust recognition accuracy over conventional unimodal systems and other related bimodal systems, namely, Reliability ratio and Neural Network based AVSR systems.

Keywords: AVSR; audio-visual speech recognition; side face feature extraction; visual feature extraction; audio visual decision fusion; reliability ratio; weight optimisation; late integration; genetic algorithms.

DOI: 10.1504/IJSISE.2011.041605

International Journal of Signal and Imaging Systems Engineering, 2011 Vol.4 No.2, pp.123 - 131

Published online: 13 Mar 2015 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article