Open Access Article

Title: Residual-enhanced transformer for affective multi-part music generation

Authors: Wenqi Li

Addresses: Department of Arts, SouthWest Petroleum University, Sichuan 610500, China

Abstract: Music generation demands modelling intricate multi-dimensional sequences while preserving structural coherence and emotional expressiveness. To address transformer's limitations in detail retention, multi-track efficiency, and affective integration, we propose residual enhanced affective music transformer (REAM) with three key innovations: 1) residual dense blocks establishing inter-layer skip connections to enhance feature reuse and maintain fine-grained musical textures; 2) emotion-aware rotary positional encoding that dynamically modulates note relationships based on target sentiment vectors; 3) lightweight residual modules enabling efficient parallel generation of multi-track compositions. Through systematic ablation studies and perceptual evaluations, REAM demonstrates superior performance in both objective reconstruction metrics and subjective musicality assessments. This framework bridges symbolic precision with affective depth, enabling computationally efficient generation of structurally coherent, emotionally controllable multi-instrument music compositions.

Keywords: multi-part music generation; transformer; residual network; emotional modelling; lightweight architecture.

DOI: 10.1504/IJICT.2025.148132

International Journal of Information and Communication Technology, 2025 Vol.26 No.31, pp.88 - 104

Received: 16 Jun 2025
Accepted: 02 Jul 2025

Published online: 26 Aug 2025 *