2024
DOI: 10.1609/aaai.v38i2.27837
|View full text |Cite
|
Sign up to set email alerts
|

DiffSED: Sound Event Detection with Denoising Diffusion

Swapnil Bhosale,
Sauradip Nag,
Diptesh Kanojia
et al.

Abstract: Sound Event Detection (SED) aims to predict the temporal boundaries of all the events of interest and their class labels, given an unconstrained audio sample. Taking either the split-and-classify (i.e., frame-level) strategy or the more principled event-level modeling approach, all existing methods consider the SED problem from the discriminative learning perspective. In this work, we reformulate the SED problem by taking a generative learning perspective. Specifically, we aim to generate sound temporal bounda… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2025
2025
2025
2025

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
references
References 31 publications
(45 reference statements)
0
0
0
Order By: Relevance