2022
DOI: 10.1155/2022/4439738
|View full text |Cite
|
Sign up to set email alerts
|

The Classification of Music and Art Genres under the Visual Threshold of Deep Learning

Abstract: Wireless networks are commonly employed for ambient assisted living applications, and artificial intelligence-enabled event detection and classification processes have become familiar. However, music is a kind of time-series data, and it is challenging to design an effective music genre classification (MGC) system due to a large quantity of music data. Robust MGC techniques necessitate a massive amount of data, which is time-consuming, laborious, and requires expert knowledge. Few studies have focused on the d… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(5 citation statements)
references
References 17 publications
(17 reference statements)
0
5
0
Order By: Relevance
“…For instance, videos can be classified by genre, scene type, or content theme, while images can be categorised by subject matter or visual style. This classification aids in organising the archives and facilitating targeted retrieval of relevant assets (Zheng 2022). Computer vision algorithms can enhance the quality of audiovisual content by performing tasks such as image restoration, video stabilisation, noise reduction, and resolution enhancement.…”
Section: Computer Visionmentioning
confidence: 99%
See 2 more Smart Citations
“…For instance, videos can be classified by genre, scene type, or content theme, while images can be categorised by subject matter or visual style. This classification aids in organising the archives and facilitating targeted retrieval of relevant assets (Zheng 2022). Computer vision algorithms can enhance the quality of audiovisual content by performing tasks such as image restoration, video stabilisation, noise reduction, and resolution enhancement.…”
Section: Computer Visionmentioning
confidence: 99%
“…These tags can include information about objects, scenes, locations, people, emotions, and actions depicted in the media. This automated tagging process speeds up the archival workflow and ensures consistent and accurate metadata generation across a large volume of assets (Zheng 2022).…”
Section: Machine Learning In Audiovisual Archivesmentioning
confidence: 99%
See 1 more Smart Citation
“…Schedl [40] reviews the music recommendation systems based on deep learning. Zheng et al [41] present a deep learning-enabled music genre classification method, and Majidi et al [42] propose a deep learning-based method for automatic music generation. Yang et al [43] present a deep learning-based method for music content recognition and recommendation system, Lee et al [44], propose a deep metric learning-based music similarity evaluation system, Elbir et al [45], introduce MusicRecNet, a CNN-based model which is trained for music similarity analysis, Yesiler et al [46] and Du et al [47] present deep learning based methods for identifying cover songs, Thome et al [48] presented a CNN-based music similarity-based search engine, to help find similar songs for video production and Dhand et al [49] proposed a system that detects the mood of the user and is further trained with a logistic regression-based model to recommend music based on the user's facial expressions.…”
Section: Deep Learning In Mir Datamentioning
confidence: 99%
“…As methods for learning, multilayer neural networks, such as deep learning, which are currently widely used, are attracting attention due to their amazing prediction performances [32][33][34][35]. Deep learning is an AI technology that automatically extracts and learns features based on large amounts of data [36,37]. It essentially consists of a multilayer neural network and an algorithm that mimics human neurons, and it automatically processes and learns input data and passes them to the next layer, which consists of three or more layers; in these layers, it is possible to deepen the characteristics of the data to be learned using multiple layers of this neural network, which result in deep-learning models with extremely high accuracy, sometimes surpassing human recognition accuracy [38][39][40][41][42][43].…”
Section: Introductionmentioning
confidence: 99%