2018 16th International Workshop on Acoustic Signal Enhancement (IWAENC) 2018
DOI: 10.1109/iwaenc.2018.8521367
|View full text |Cite
|
Sign up to set email alerts
|

Using Sequential Information in Polyphonic Sound Event Detection

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
11
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
3
2

Relationship

1
4

Authors

Journals

citations
Cited by 8 publications
(11 citation statements)
references
References 14 publications
0
11
0
Order By: Relevance
“…In SED, this results in letting the RNN learn a language model over the sound events, e.g., which sound events are more likely to happen together and/or in sequence, or how likely is a sound event to keep being active, given the previous activity of the sound events. Teacher forcing is different from what was proposed in [13], as the latter approach conditioned the DNN (not the RNN) with the class activities: such an approach yielded poor results, intuitively explained by having y t−1 dominated by the information in X through the sequence of the CNN blocks.…”
Section: Teacher Forcingmentioning
confidence: 93%
See 4 more Smart Citations
“…In SED, this results in letting the RNN learn a language model over the sound events, e.g., which sound events are more likely to happen together and/or in sequence, or how likely is a sound event to keep being active, given the previous activity of the sound events. Teacher forcing is different from what was proposed in [13], as the latter approach conditioned the DNN (not the RNN) with the class activities: such an approach yielded poor results, intuitively explained by having y t−1 dominated by the information in X through the sequence of the CNN blocks.…”
Section: Teacher Forcingmentioning
confidence: 93%
“…Finally, we compare our method to the best results presented in [13] which are obtained by employing N-grams as a postprocessing to learn a language model. We report the results of this method on the TUT Sound Events 2016 datasets, as these are the only ones in the corresponding paper that are based on a publicly available dataset.…”
Section: Baselinementioning
confidence: 99%
See 3 more Smart Citations