Title: A grading evaluation method for English oral pronunciation errors based on deep neural networks
Authors: Jian Sun; Li Zhang; Guanghui Shu
Addresses: School of Foreign Languages, Cangzhou Jiaotong College, Cangzhou, 061199, China ' School of Foreign Languages, Cangzhou Jiaotong College, Cangzhou, 061199, China ' School of Foreign Languages, Cangzhou Jiaotong College, Cangzhou, 061199, China
Abstract: In this paper, a deep neural network-based grading method for English oral pronunciation errors is proposed. First, English oral pronunciation signals are pre-processed and MFCC feature vectors are extracted. Second, Hidden Markov model is used to construct an acoustic model, with deep neural network used to predict the state probability distribution of acoustic feature vectors, replacing the observation probability of the acoustic model. Then, a language model is constructed to obtain the probability of word order, and combined with an acoustic model to build a search network. Viterbi algorithm was used to decode and find the phoneme state sequence. Finally, based on the reference phoneme sequence, the degree of pronunciation errors are calculated, compared with a threshold, and achieved graded evaluation. The results indicate that the AUC value of the proposed method is close to 1, and the F1 value is above 0.95, indicating a high accuracy of the evaluation.
Keywords: spoken English; pronunciation error; graded evaluation; hidden Markov model; deep neural network; DNN.
International Journal of Biometrics, 2026 Vol.18 No.1/2/3, pp.17 - 38
Received: 21 Oct 2024
Accepted: 28 Dec 2024
Published online: 13 Jan 2026 *