2018
DOI: 10.1016/j.patcog.2018.03.025
|View full text |Cite
|
Sign up to set email alerts
|

Random forest classification based acoustic event detection utilizing contextual-information and bottleneck features

Abstract: The variety of event categories and event boundary information have resulted in limited success for acoustic event detection systems. To deal with this, we propose to utilize the long contextual information, low-dimensional discriminant global bottleneck features and category-specific bottleneck features. By concatenating several adjacent frames together, the use of contextual information makes it easier to cope with acoustic signals with long duration. Global and category-specific bottleneck features can extr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4

Citation Types

0
18
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
6
1
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 45 publications
(18 citation statements)
references
References 40 publications
0
18
0
Order By: Relevance
“…Değer et al [15] proposed a classification algorithm based on RF, which solved the problem of poor robustness in face emotion recognition. Xiao et al [38] greatly improved the success rate of acoustic event monitoring system by using RF classifier. Fabris et al [17] used RF to predict gene expression as the brain ages.…”
Section: Introductionmentioning
confidence: 99%
“…Değer et al [15] proposed a classification algorithm based on RF, which solved the problem of poor robustness in face emotion recognition. Xiao et al [38] greatly improved the success rate of acoustic event monitoring system by using RF classifier. Fabris et al [17] used RF to predict gene expression as the brain ages.…”
Section: Introductionmentioning
confidence: 99%
“…Corresponding author: h.phan@kent.ac.uk reduction [22,7]. Particularly, the multitasking approach that jointly performs event detection and event boundary estimation [23,6,24] has demonstrated state-of-the-art performance on different benchmark datasets. In the latter stream, overlapping events are either separated using source separation methods [25,26] prior to detection, or recognized via a selection of local spectral features [11,10,7].…”
Section: Introductionmentioning
confidence: 99%
“…However, there exists a methodological gap between them. Audio events intrinsically possess temporal structures, and tailoring a network's output layer and loss functions for structure modelling has been shown to be efficient for the isolated AED [23,6,24]. However, this capacity has been uncharted for overlapping AED, and it remains questionable how to generalize a network's output layer and tailor its loss functions [20,6] to accommodate arbitrary event overlap, i.e.…”
Section: Introductionmentioning
confidence: 99%
“…SVM have been used with the energy and the signal spectrogram in [18] and [19] for surveillance application, and also in [20] respectively, using a deep architecture (DCNN). Other authors considered also Random Forest [24] using more contextual information.…”
Section: Introductionmentioning
confidence: 99%