Title: An investigation of CNN-LSTM music recognition algorithm in ethnic vocal technique singing
Authors: Fang Dong
Addresses: Conservatory of Music, Baotou Teachers' College, Baotou, 014030, China
Abstract: A HPSS separation algorithm considering time and frequency features is proposed to address the issue of poor performance in music style recognition and classification. A CNN network structure was designed and the influence of different parameters in the network structure on recognition rate was studied. A deep hash learning method is proposed to address the issues of weak feature expression ability and high feature dimension in existing CNN, which is combined with LSTM networks to integrate temporal dimension information. The results showed that compared to other models such as GRU+LSTM, the double-layer LSTM model used in the study had higher recognition results, with a size of over 75%. This indicates that combining feature learning with hash encoding learning can achieve higher accuracy. Therefore, this model is more suitable for music style recognition technology, which helps in music information retrieval and improves the classification accuracy of music recognition.
Keywords: music recognition; ethnic vocal music; LSTM; CNN; hash layer.
DOI: 10.1504/IJCSE.2024.141337
International Journal of Computational Science and Engineering, 2024 Vol.27 No.5, pp.505 - 514
Received: 09 Jan 2023
Accepted: 18 Jul 2023
Published online: 09 Sep 2024 *