Open Access Article

Title: Lightweight CNN-transformer hybrid network for English speech recognition

Authors: Yan Li; Weiguo Huang; Cui Gu

Addresses: School of Foreign Languages, Hunan University of Arts and Science, Changde 415000, Hunan, China ' School of Information Engineering, Hunan University of Science and Engineering, Yongzhou 425199, Hunan, China ' Department of Academic Affairs, Changde College, Changde – 415000, Hunan, China

Abstract: Speech recognition is the core technology for achieving human-computer interaction, among which English speech recognition has extremely high practical value in global communication scenarios. Although CNN-based speech recognition models are good at extracting local features, they cannot effectively capture global semantics. In contrast, transformer-based models outperform CNN in extracting global semantics, but their model parameters and computational complexity are high, making it difficult to deploy and run on resource constrained devices. Inspired by this, we proposes a lightweight CNN-transformer hybrid network (LwCTHNet) for English speech recognition. LwCTHNet effectively integrates local feature extraction, frequency domain detail supplementation, and global semantic capture capabilities by alternately stacking 3 × 3 convolution layers, wavelet enhanced convolution modules, and lightweight transformer modules. In addition, it also achieves multi-scale feature learning through skip connections and enhances feature discriminability by using a mixed loss function that combines cross entropy loss and contrastive loss. The experimental results on three English speech recognition datasets show that the proposed method not only has the minimum parameter size, but also achieves an approximately optimal word error rate. This indicates that the proposed LwCTHNet method has achieved a good balance in recognition performance, computational complexity, and parameter size.

Keywords: lightweight model; English speech recognition; transformer; multi-scale feature learning.

DOI: 10.1504/IJBIDM.2026.153309

International Journal of Business Intelligence and Data Mining, 2026 Vol.28 No.7, pp.1 - 22

Received: 17 Sep 2025
Accepted: 13 Jan 2026

Published online: 01 May 2026 *