Article: Spectral and prosodic features-based speech pattern classification Journal: International Journal of Applied Pattern Recognition (IJAPR) 2015 Vol.2 No.1 pp.96 - 110 Abstract: Speech pattern produced by individuals are unique. This uniqueness is due to the accent influenced by individual's native dialect. Prior knowledge of spoken dialect provides valuable information for speaker profiling and incorporating them in the decision parameter can improve the system performance. In this paper, an auto-associative neural network model has been proposed to model intrinsic characteristics of speech features for dialect classification. This paper highlights the sufficiency of few spectral and prosodic features for identification of Hindi dialects. Experimental results show that system performance is the best when both spectral and prosodic features are combined to use as input. In the presence of noise, performance of a conventional ASR starts to degrade. The NOISEX-92 database is used to add white noise to the recorded utterances in the range of 0 dB to 20 dB. This paper evaluates the dialect classification system's performance for SNRs in this range. Inderscience Publishers - linking academia, business and industry through research

Title: Spectral and prosodic features-based speech pattern classification

Authors: Shweta Sinha; Aruna Jain; S.S. Agrawal

Addresses: Department of Computer Science and Engineering, Birla Institute of Technology, Mesra, Ranchi, India ' Department of Computer Science and Engineering, Birla Institute of Technology, Mesra, Ranchi, India ' KIIT Group of Colleges, KIIT Campus, Sohna Road, Gurgaon, Haryana, India

Abstract: Speech pattern produced by individuals are unique. This uniqueness is due to the accent influenced by individual's native dialect. Prior knowledge of spoken dialect provides valuable information for speaker profiling and incorporating them in the decision parameter can improve the system performance. In this paper, an auto-associative neural network model has been proposed to model intrinsic characteristics of speech features for dialect classification. This paper highlights the sufficiency of few spectral and prosodic features for identification of Hindi dialects. Experimental results show that system performance is the best when both spectral and prosodic features are combined to use as input. In the presence of noise, performance of a conventional ASR starts to degrade. The NOISEX-92 database is used to add white noise to the recorded utterances in the range of 0 dB to 20 dB. This paper evaluates the dialect classification system's performance for SNRs in this range.

Keywords: speech pattern identification; dialect classification; auto-associative neural networks; AANNs; feature compression; Hindi dialects; speech features; prosodic features; spectral features; speech patterns; speech classification; native dialects; India.

DOI: 10.1504/IJAPR.2015.068947

International Journal of Applied Pattern Recognition, 2015 Vol.2 No.1, pp.96 - 110

Received: 27 Aug 2014
Accepted: 06 Jan 2015
Published online: 21 Apr 2015 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article

Title: Spectral and prosodic features-based speech pattern classification

Keep up-to-date