Authors: Guohua Hu; Guoyan Meng; Qingshan Zhao; Xiaoxia Zheng
Addresses: Department of Computer, Xinzhou Teachers University, Xinzhou, Shanxi, 034000, China ' Department of Mathmatics, Xinzhou Teachers university Xinzhou, Shanxi, 034000, China ' Department of Computer, Xinzhou Teachers University, Xinzhou, Shanxi, 034000, China ' Department of Computer, Xinzhou Teachers University, Xinzhou, Shanxi, 034000, China
Abstract: The interference and redundancy of speech emotional features will directly affect the recognition performance of emotional features. In order to enhance the ability of emotional features to recognise speech emotion, dimension reduction method was used to optimise emotional features. Four emotion (sad, angry, happy and neutral) voices were selected from Berlin speech database and CASIA corpus, and traditional emotion features (prosodic features, formant features and mel frequency cepstrum coefficient (MFCC) features) were extracted. In order to reduce the mutual interference between features, principal component analysis (PCA) was used to extract the principal components of features to get independent features. Meanwhile, in order to get the emotional features that are highly related to the emotional type, grey relational analysis (GRA) was used to select the main features from the principal components and design experiments for comparison. The experimental results show that PCA-GRA dimensionality reduction method can reduce the correlation of features and get the feature set with small redundancy, so as to improve the recognition effect of speech emotion.
Keywords: emotional speech; emotion recognition; feature optimisation; PCA; principal component analysis; GRA; grey relational analysis.
International Journal of Computing Science and Mathematics, 2020 Vol.12 No.4, pp.339 - 349
Received: 07 Apr 2020
Accepted: 15 Jun 2020
Published online: 26 Jan 2021 *