Title: Use of radial basis function network with discrete wavelet transform for speech enhancement

Authors: Rashmirekha Ram; Mihir Narayan Mohanty

Addresses: Electronics and Communication Engineering, Biomedical and Speech Processing Laboratory, Institute of Technical Education and Research, Siksha 'O' Anusandhan (Deemed to be University), Bhubaneswar, India ' Electronics and Communication Engineering, Biomedical and Speech Processing Laboratory, Institute of Technical Education and Research, Siksha 'O' Anusandhan (Deemed to be University), Bhubaneswar, India

Abstract: Neural network has occupied a very good position in the field of detection, recognition and classification. However the use of these models for signal enhancement is a new direction of research. In this paper, neural network is used to enhance the quality of the speech. The efficient model radial basis function network (RBFN) is chosen for enhancement of the noisy signals. Wavelet Transform is used for decomposition of signal. It works in both the ways. In first stage, the noise from the input signal is reduced. Next to it, these coefficients are used as weights of the RBFN model that makes faster processing as compared to use of random weights. The output of the proposed model is measured in terms of signal to noise ratio (SNR), segmental signal to noise ratio (SegSNR) and perceptual evaluation of speech quality (PESQ). The performance of the proposed method found excellent and is exhibited in the result section.

Keywords: speech enhancement; wavelet transform; radial basis function network; RBFN; signal to noise ratio; SNR; segmental signal to noise ratio; SegSNR; perceptual evaluation of speech quality; PESQ.

DOI: 10.1504/IJCVR.2019.098801

International Journal of Computational Vision and Robotics, 2019 Vol.9 No.2, pp.207 - 223

Received: 15 Feb 2018
Accepted: 11 Jul 2018

Published online: 02 Apr 2019 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article