ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2020
DOI: 10.1109/icassp40776.2020.9053972
|View full text |Cite
|
Sign up to set email alerts
|

Multi-Label Sound Event Retrieval Using A Deep Learning-Based Siamese Structure With A Pairwise Presence Matrix

Abstract: Realistic recordings of soundscapes often have multiple sound events co-occurring, such as car horns, engine and human voices. Sound event retrieval is a type of contentbased search aiming at finding audio samples, similar to an audio query based on their acoustic or semantic content. State of the art sound event retrieval models have focused on single-label audio recordings, with only one sound event occurring, rather than on multi-label audio recordings (i.e., multiple sound events occur in one recording). T… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(1 citation statement)
references
References 21 publications
0
1
0
Order By: Relevance
“…Using this data-set, methods for both detection [35] and classification [36] of acoustic scenes and events have been carried out. Recently, a deep learning structure has been developed with this data-set for sound event retrieval [37] of urban sound events, such as car horns and human speech, on multi-label audio recordings.…”
Section: Machine Learning For Analysis Of Environmental Acousticsmentioning
confidence: 99%
“…Using this data-set, methods for both detection [35] and classification [36] of acoustic scenes and events have been carried out. Recently, a deep learning structure has been developed with this data-set for sound event retrieval [37] of urban sound events, such as car horns and human speech, on multi-label audio recordings.…”
Section: Machine Learning For Analysis Of Environmental Acousticsmentioning
confidence: 99%