Title: On the use of perceptual Line Spectral pairs Frequencies and higher-order residual moments for Speaker Identification

Authors: Md. Sahidullah, Sandipan Chakroborty, Goutam Saha

Addresses: Department of Electronics and Electrical Communication Engineering, Indian Institute of Technology Kharagpur, Kharagpur 721 302, India. ' Department of Electronics and Electrical Communication Engineering, Indian Institute of Technology Kharagpur, Kharagpur 721 302, India. ' Department of Electronics and Electrical Communication Engineering, Indian Institute of Technology Kharagpur, Kharagpur 721 302, India

Abstract: Conventional Speaker Identification (SI) systems utilise spectral features like Mel-Frequency Cepstral Coefficients (MFCC) or Perceptual Linear Prediction (PLP) as a frontend module. Line Spectral pairs Frequencies (LSF) are popular alternative representation of Linear Prediction Coefficients (LPC). In this paper, an investigation is carried out to extract LSF from perceptually modified speech. A new feature set extracted from the residual signal is also proposed. SI system based on this residual feature containing complementary information to spectral characteristics, when fused with the conventional spectral feature based system as well as the proposed perceptually modified LSF, shows improved performance.

Keywords: speaker identification; line spectral pairs frequencies; perceptual linear prediction; residual signals; higher order statistics; spectral features; modified speech; feature extraction; biometrics; speaker recognition.

DOI: 10.1504/IJBM.2010.035450

International Journal of Biometrics, 2010 Vol.2 No.4, pp.358 - 378

Published online: 30 Sep 2010 *

Full-text access for editors Access for subscribers Purchase this article Comment on this article