2019
DOI: 10.3390/s19143206
|View full text |Cite
|
Sign up to set email alerts
|

Adaptive Noise Reduction for Sound Event Detection Using Subband-Weighted NMF

Abstract: Sound event detection in real-world environments suffers from the interference of non-stationary and time-varying noise. This paper presents an adaptive noise reduction method for sound event detection based on non-negative matrix factorization (NMF). First, a scheme for noise dictionary learning from the input noisy signal is employed by the technique of robust NMF, which supports adaptation to noise variations. The estimated noise dictionary is used to develop a supervised source separation framework in comb… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
19
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 24 publications
(19 citation statements)
references
References 31 publications
0
19
0
Order By: Relevance
“…Whereas, for Pooling Size in Table 9, the value indicates the pooling size at each layer. VOLUME XX, 2017 9 Thus, a pooling size of (1, 5), (1,4), (1,2) represent that the T-F input only has its frequency dimension reduced by 5 times in the first layer, further reduced by another 4 times in the second layer and another 2 times in the last layer. Using a 40 mel bands T-F input as an example, the 40 bands become 1 band in 3 stages: 40 → 8 → 2 → 1.…”
Section: Summary Of Hybrid Modelsmentioning
confidence: 99%
See 2 more Smart Citations
“…Whereas, for Pooling Size in Table 9, the value indicates the pooling size at each layer. VOLUME XX, 2017 9 Thus, a pooling size of (1, 5), (1,4), (1,2) represent that the T-F input only has its frequency dimension reduced by 5 times in the first layer, further reduced by another 4 times in the second layer and another 2 times in the last layer. Using a 40 mel bands T-F input as an example, the 40 bands become 1 band in 3 stages: 40 → 8 → 2 → 1.…”
Section: Summary Of Hybrid Modelsmentioning
confidence: 99%
“…After a specific training epoch, the training procedure transit into the second stage where the combined cost was added with another component, 4 C which can be represented as…”
Section: Figure 14 Flowchart Of Lin Et Al Framework [114]mentioning
confidence: 99%
See 1 more Smart Citation
“…NMF aims at finding two non-negative matrix and to roughly approximate the original data matrix, i.e., . Since many real-world datasets are usually high-dimensional, the NMF methods have been widely applied in image analysis [ 40 ], data mining, speech denoising [ 41 ] and population genetics, etc. The semi-NMF [ 42 ] is an extension of traditional NMF, which only requires the coefficient matrix to be non-negative.…”
Section: Related Workmentioning
confidence: 99%
“…Sound event classification (SEC) has vastly been exploited by many researchers. Sound can be categorized into speech, music, noise, environmental sound, or daily living sound [26]. Sound events are available in all classes, for example, car horn, traffic, walking, or knocking, etc.…”
Section: Introductionmentioning
confidence: 99%