Title: Building acoustic model for phoneme recognition using PSO-DBN

Authors: B.R. Laxmi Sree; M.S. Vijaya

Addresses: PSGR Krishnammal College for Women, Avinashi Road, Peelamedu, Coimbatore 641004, Tamil Nadu, India ' PSGR Krishnammal College for Women, Avinashi Road, Peelamedu, Coimbatore 641004, Tamil Nadu, India

Abstract: Deep neural networks has shown its power in generous classification problems including speech recognition. This paper proposes to enhance the power of deep belief network (DBN) further by pre-training the neural network using particle swarm optimisation (PSO). The objective of this work is to build an efficient acoustic model with deep belief networks for phoneme recognition with much better computational complexity. The result of using PSO for pre-training the network drastically reduces the training time of DBN and also decreases the phoneme error rate (PER) of the acoustic model built to classify the phonemes. Three variations of PSO namely, the basic PSO, second generation PSO (SGPSO) and the new model PSO (NMPSO) are applied in pre-training the DBN to analyse their performance on phoneme classification. It is observed that the basic PSO is performing comparably better to other PSOs considered in this work, most of the time.

Keywords: phoneme recognition; deep neural networks; particle swarm optimisation; acoustic model; Tamil speech recognition; deep learning; deep belief networks.

DOI: 10.1504/IJBIDM.2020.107543

International Journal of Business Intelligence and Data Mining, 2020 Vol.16 No.4, pp.506 - 523

Received: 24 Aug 2017
Accepted: 22 Nov 2017

Published online: 02 Apr 2020 *

Full-text access for editors Access for subscribers Purchase this article Comment on this article