2017
DOI: 10.1121/1.4989346
|View full text |Cite
|
Sign up to set email alerts
|

Creation of a corpus of realistic urban sound scenes with controlled acoustic properties

Abstract: Sound source detection and recognition using acoustic sensors are increasingly used to monitor and analyze the urban environment as they enhance soundscape characterization and facilitate the comparison between simulated and measured noise maps using methods such as Artificial Neural Networks or Non-negative Matrix Factorization. However, the community lacks corpuses of sound scenes whose acoustic properties of each source present within the scene are precisely known. In this study, a set of 40 sound scenes ty… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
4
1

Relationship

2
3

Authors

Journals

citations
Cited by 5 publications
(7 citation statements)
references
References 8 publications
0
7
0
Order By: Relevance
“…The original recordings range from 55 s to 4.5 mn in duration. In [43], the extracts are manually annotated and classified in terms of ambiance (park, quiet street, noisy street and very noisy street). The available annotations include:…”
Section: Stimulimentioning
confidence: 99%
See 2 more Smart Citations
“…The original recordings range from 55 s to 4.5 mn in duration. In [43], the extracts are manually annotated and classified in terms of ambiance (park, quiet street, noisy street and very noisy street). The available annotations include:…”
Section: Stimulimentioning
confidence: 99%
“…The 45 s segments are selected to represent the properties of their respective ambiances in terms of source composition, without single events overwhelming their overall perception. The manual annotations available in [43] of background and event information are then used to replicate the sound scenes using simScene in the replicate mode, using the same isolated samples database as for the 75 simulated scenes. The 19 resulting sound scenes are thus simulated, but their scenarios follow those of the reference recordings to the extent of annotation precision.…”
Section: Stimulimentioning
confidence: 99%
See 1 more Smart Citation
“…where More details on the results can be found in [53]. As the perceived realism of the replicated and the recorded scenes are not significantly different, we consider that these sound mixtures are relevant to assess the performances of NMF according to the traffic sound level estimate.…”
Section: Generation Of the Evaluation Corpusmentioning
confidence: 99%
“…Moreover, the existing corpora are either poorly annotated, either unrealistic or too complex. Therefore, in the case of a realistic sound mixture, the acoustic properties of the sound are hard to model [8]. The use of audio data from different recording points offers unprecedented opportunities for a relevant pattern of audio events.…”
Section: Audio Datasetsmentioning
confidence: 99%