2017
DOI: 10.1016/j.procs.2017.06.020
|View full text |Cite
|
Sign up to set email alerts
|

Lanczos kernel based spectrogram image features for sound classification

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0
2

Year Published

2018
2018
2023
2023

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 12 publications
(7 citation statements)
references
References 15 publications
0
5
0
2
Order By: Relevance
“…Greco et al [60] Environmental sound and acoustic scene classification Sound/event credit network [45,47,112] Sound event detection Adaboost Ykhlef et al [174] Sound classification Using the restricted boltzmann machine, DNN was trained Ozer et al [124] Fig. 5 Block diagram of the signal preprocessing stage [119,147] parameters and frequency sub-bands using the WPT were derived from the sub-bands to describe the distribution of wavelet coefficients.…”
Section: Transformersmentioning
confidence: 99%
“…Greco et al [60] Environmental sound and acoustic scene classification Sound/event credit network [45,47,112] Sound event detection Adaboost Ykhlef et al [174] Sound classification Using the restricted boltzmann machine, DNN was trained Ozer et al [124] Fig. 5 Block diagram of the signal preprocessing stage [119,147] parameters and frequency sub-bands using the WPT were derived from the sub-bands to describe the distribution of wavelet coefficients.…”
Section: Transformersmentioning
confidence: 99%
“…To apply an image classification approach, audio data were pre-processed and transformed into image data. For example, features extracted from the Spectrogram image were shown to improve the performance of acoustic event classifications [11]. Besides, Spectrogram images were also being used for rapid speaker recognition and artificial speech detection [12].…”
Section: Figure 1 the Illustration Of Speaker Identification Versus Speaker Verificationmentioning
confidence: 99%
“…In this work, rather than using conventional signal processing to extract features from speech, we proposed to use data transformation to apply the techniques from other domains such as image processing for artificial speech detection. The proposed approach is motivated by the success of deploying image classification techniques to sounds classification and speaker recognition [11]. An ensemble classifier in the form of random forest (RF) was used to generate the artificial speech model.…”
Section: Figure 1 the Illustration Of Speaker Identification Versus Speaker Verificationmentioning
confidence: 99%
“…Bu, veri kümelerinde derin öğrenme modellerinin sınıflandırma performansını kısıtlamıştır. Bu sorunun çözümü, önceden eğitilmiş olarak bilinen bir yöntemle gelmiştir [44][46] [48]. Bu yöntemin arkasındaki fikir, metnin genel temsilini öğrenmek ve çok miktarda etiketlenmemiş metinden oluşan genişletilmiş derlemi eğitmektir.…”
Section: Kelime Temsil Yöntemiunclassified