Music rhythm detection and tracking is an important part of music understanding system and music visualization system. Based on the important position of rhythm in music expression and the wide range of multimedia applications, rhythm extraction has become an important hotspot in computer music analysis. In the field of audio recognition research, deep learning can automatically learn the features of audio and extract the rhythm of music. This paper takes music audio rhythm recognition as the main research object and carries out a series of researches with deep learning GRU neural network as the main technical support. A residual network is introduced into the GRU network model, and it is found that when the residual network is at 50 layers, the model has the highest accuracy for audio rhythm extraction. After adjusting the model parameters through experiments, this paper concludes that the average recognition accuracy of the ResNet_50-GRU model for recognizing the rhythm of the music audio in the MSD, AudioSet, and FMA data sets is 92.5%.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.