Proceedings of the Detection and Classification of Acoustic Scenes And Events 2019 Workshop (DCASE2019) 2019
DOI: 10.33682/j5zw-2t88
|View full text |Cite
|
Sign up to set email alerts
|

SONYC Urban Sound Tagging (SONYC-UST): A Multilabel Dataset from an Urban Acoustic Sensor Network

Abstract: SONYC Urban Sound Tagging (SONYC-UST) is a dataset for the development and evaluation of machine listening systems for realworld urban noise monitoring. It consists of 3068 audio recordings from the "Sounds of New York City" (SONYC) acoustic sensor network. Via the Zooniverse citizen science platform, volunteers tagged the presence of 23 fine-grained classes that were chosen in consultation with the New York City Department of Environmental Protection. These 23 fine-grained classes can be grouped into eight co… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
44
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
3
2

Relationship

3
6

Authors

Journals

citations
Cited by 46 publications
(44 citation statements)
references
References 0 publications
0
44
0
Order By: Relevance
“…However, the embeddings zi and zj are obtained with a single instantiation of the encoder and projection head (see shared weights in Figure 1). [23,24,25,21,26]). The limited dataset scope implies, however, that the learned representations are not transferable to unrelated datasets or downstream tasks, forcing to conduct an in-domain evaluation.…”
Section: Methodsmentioning
confidence: 99%
“…However, the embeddings zi and zj are obtained with a single instantiation of the encoder and projection head (see shared weights in Figure 1). [23,24,25,21,26]). The limited dataset scope implies, however, that the learned representations are not transferable to unrelated datasets or downstream tasks, forcing to conduct an in-domain evaluation.…”
Section: Methodsmentioning
confidence: 99%
“…For training the downstream classifier from the student embeddings, we leverage SONYC Urban Sound Tagging (SONYC-UST) [22], a fraction of SONYC data annotated through crowdsourcing initiatives on the Zooniverse [23] platform, as the Downstream dataset. It is a multi-label dataset consisting of 3068 annotated 10second audio recordings belonging to 8 classes: engine, machineryimpact, non-machinery-impact, powered-saw, alert-signal, music, human-voice, and dog.…”
Section: ℝ Nmentioning
confidence: 99%
“…Another WASN-based project that has collected real operation acoustic samples is the SONYC (Sounds of New York) project [32]. Bello provides a simplified taxonomy of the sounds of the city by means of a two-level hierarchy, dividing them into eight coarse categories and 23 fine labels [33]. The generated dataset is composed of 2351 recordings in the train split and 443 in the validation split, making a total of 2794 audio samples of 10 s each.…”
Section: Related Workmentioning
confidence: 99%