2020
DOI: 10.1109/access.2020.3017661
|View full text |Cite
|
Sign up to set email alerts
|

Multi-Level Local Feature Coding Fusion for Music Genre Recognition

Abstract: Music genre recognition (MGR) plays a fundamental role in the context of music indexing and retrieval. Unlike images, music genres consist of immediate characteristics that are highly diversified with abstractions in different levels. However, most representation learning methods for MGR focus on global features and make decisions from features in the same level. To remedy such defects, we intergrate a convolutional neural network (CNN) with NetVLAD and self-attention to capture the local information across le… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
13
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 30 publications
(15 citation statements)
references
References 39 publications
0
13
0
Order By: Relevance
“…The GTZAN Genre mainly divides music genres into 10 categories: blues, country, hip-hop, jazz, pop, disco, classical, rock, reggae, and metal. The ISMIR2004 Genre mainly divides music into six genres: classical, electronic, jazz/blues, metal/punk, and rock/pop ( Ng et al, 2020 ).…”
Section: Methodsmentioning
confidence: 99%
“…The GTZAN Genre mainly divides music genres into 10 categories: blues, country, hip-hop, jazz, pop, disco, classical, rock, reggae, and metal. The ISMIR2004 Genre mainly divides music into six genres: classical, electronic, jazz/blues, metal/punk, and rock/pop ( Ng et al, 2020 ).…”
Section: Methodsmentioning
confidence: 99%
“…The recognition accuracy is improved by using the information obtained from the offline data set, including RGB information and available depth and bone information. Ng et al proposed an optimization method based on machine learning to match dance technical movements and music in music arrangement [ 19 ]. First, combined with machine learning theory, this method constructs the mapping relationship between dance action and music based on historical sample data set and obtains the evaluation function of the harmonious relationship between dance action and music.…”
Section: Related Workmentioning
confidence: 99%
“…The experimental data are from the dance action capture database provided by Carnegie Mellon University Laboratory, and the length of each music segment is about 5 seconds. Using the methods of this study and references [ 17 ] and [ 19 ], the optimization experiment of dance action matching in music choreography is carried out, and the matching degrees (%) of the three methods are compared. Figure 7 shows the comparison results of dance movements in music arrangement.…”
Section: Simulation Analysis Of Matching Optimization Based On Multif...mentioning
confidence: 99%
“…This makes the problem richer and more suitable for deep models, which require larger amounts of data. Different types of deep learning models have been applied to audio classification problems, such as Deep Belief Networks (DBNs), Restricted Boltzmann Machines (RBMs), Convolutional neural networks (CNNs) [24] and Recurrent neural networks (RNNs) [25]- [27].…”
Section: Fully Connected Neural Network (Fcnns)mentioning
confidence: 99%