2019
DOI: 10.1007/s12652-019-01303-4
|View full text |Cite
|
Sign up to set email alerts
|

Speech/music classification using visual and spectral chromagram features

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
21
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 36 publications
(21 citation statements)
references
References 53 publications
0
21
0
Order By: Relevance
“…Skewness is a measure of the asymmetry of a signal and can represent the relative tendency of tonal and nontonal components in the band. For a sequence x of length N, the mean is 􏽐 N n�1 (x n − u) 3 and the standard deviation is σ. e mathematical expression of skewness is as follows:…”
Section: Skewness and Kurtosismentioning
confidence: 99%
See 1 more Smart Citation
“…Skewness is a measure of the asymmetry of a signal and can represent the relative tendency of tonal and nontonal components in the band. For a sequence x of length N, the mean is 􏽐 N n�1 (x n − u) 3 and the standard deviation is σ. e mathematical expression of skewness is as follows:…”
Section: Skewness and Kurtosismentioning
confidence: 99%
“…With the continuous development of Internet technology and the improvement of the technological level, different Internet-based music multimedia began to emerge. Music multimedia is one of the more popular types of digital music, so its automatic classification and recognition has become the focus of domestic scholars' research [1][2][3]. However, how to obtain the specific content source from the initial music multimedia data, lacking the definition of the music content, has become a huge challenge for the current automatic classification of music multimedia because the music multimedia signal belongs to a way of time sequence, which can be concealed according to its concealment.…”
Section: Introductionmentioning
confidence: 99%
“…Table 6 lists all the extracted features along with their statistical measures of mean and standard deviation (STD) for method I (DWT) and method II (EMD). We extracted time domain [ 38 , 39 , 40 , 41 , 42 , 43 , 44 , 45 ], spectral [ 46 , 47 ], fractal and chaos [ 48 , 49 ], chroma [ 50 , 51 ], cepstral [ 52 ], and texture features [ 53 ] and analyzed them statistically.…”
Section: Methodsmentioning
confidence: 99%
“…At present, it is mainly used to combine the recursive K-mean algorithm and the recursive least squares method, which has achieved certain results, but more effective improvements are needed. e above issues will further promote the application of RBF networks in various fields [10].…”
Section: Introductionmentioning
confidence: 99%