Article: An investigation of CNN-LSTM music recognition algorithm in ethnic vocal technique singing Journal: International Journal of Computational Science and Engineering (IJCSE) 2024 Vol.27 No.5 pp.505 - 514 Abstract: A HPSS separation algorithm considering time and frequency features is proposed to address the issue of poor performance in music style recognition and classification. A CNN network structure was designed and the influence of different parameters in the network structure on recognition rate was studied. A deep hash learning method is proposed to address the issues of weak feature expression ability and high feature dimension in existing CNN, which is combined with LSTM networks to integrate temporal dimension information. The results showed that compared to other models such as GRU+LSTM, the double-layer LSTM model used in the study had higher recognition results, with a size of over 75%. This indicates that combining feature learning with hash encoding learning can achieve higher accuracy. Therefore, this model is more suitable for music style recognition technology, which helps in music information retrieval and improves the classification accuracy of music recognition. Inderscience Publishers - linking academia, business and industry through research

Title: An investigation of CNN-LSTM music recognition algorithm in ethnic vocal technique singing

Authors: Fang Dong

Addresses: Conservatory of Music, Baotou Teachers' College, Baotou, 014030, China

Abstract: A HPSS separation algorithm considering time and frequency features is proposed to address the issue of poor performance in music style recognition and classification. A CNN network structure was designed and the influence of different parameters in the network structure on recognition rate was studied. A deep hash learning method is proposed to address the issues of weak feature expression ability and high feature dimension in existing CNN, which is combined with LSTM networks to integrate temporal dimension information. The results showed that compared to other models such as GRU+LSTM, the double-layer LSTM model used in the study had higher recognition results, with a size of over 75%. This indicates that combining feature learning with hash encoding learning can achieve higher accuracy. Therefore, this model is more suitable for music style recognition technology, which helps in music information retrieval and improves the classification accuracy of music recognition.

Keywords: music recognition; ethnic vocal music; LSTM; CNN; hash layer.

DOI: 10.1504/IJCSE.2024.141337

International Journal of Computational Science and Engineering, 2024 Vol.27 No.5, pp.505 - 514

Received: 09 Jan 2023
Accepted: 18 Jul 2023
Published online: 09 Sep 2024 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article

Title: An investigation of CNN-LSTM music recognition algorithm in ethnic vocal technique singing

Keep up-to-date