Multimodal Technologies for Perception of Humans
DOI: 10.1007/978-3-540-69568-4_29
|View full text |Cite
|
Sign up to set email alerts
|

CLEAR Evaluation of Acoustic Event Detection and Classification Systems

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

1
92
0
2

Publication Types

Select...
6
2
1

Relationship

2
7

Authors

Journals

citations
Cited by 90 publications
(95 citation statements)
references
References 5 publications
1
92
0
2
Order By: Relevance
“…Intraclass variations and the spectral-temporal properties across classes pose great challenges to acoustic event detection. Due to the significant real world applications of AED and the challenges being faced, some campaigns, such as CLEAR [7] and D-CASE [8] [9] have attempted to capture the wide range of variations in the design of the acoustic event detection databases [10][11] [12].…”
Section: Introductionmentioning
confidence: 99%
“…Intraclass variations and the spectral-temporal properties across classes pose great challenges to acoustic event detection. Due to the significant real world applications of AED and the challenges being faced, some campaigns, such as CLEAR [7] and D-CASE [8] [9] have attempted to capture the wide range of variations in the design of the acoustic event detection databases [10][11] [12].…”
Section: Introductionmentioning
confidence: 99%
“…There are some significant techniques for these applications; acoustic event detection (AED) analyzes various types of sounds, e.g., "footsteps," "running water," "music," "voice," to detect specific types of sounds [6], [7], and acoustic scene analysis (ASA) analyzes acoustic scenes such as user activities, e.g., "cooking," "vacuuming," "watching TV," or situations, e.g., "being on the bus," "being in a park," "meeting," on the basis of the information of acoustic events [8]. In this paper, we focus on automatic estimation of acoustic scenes, which enables automatic life-logging, the monitoring of elderly people, or multimedia retrieval.…”
Section: Introductionmentioning
confidence: 99%
“…In the task of acoustic event detection (AED), for example, machines perform poorly when detecting and labeling non-speech events in long audio recordings: none of the systems competing in the CLEAR 2007 AED competition achieved greater than 30% accuracy [Temko et al 2006;Temko 2007;Zhou et al 2007]. This AED task is difficult even for humans, because of the set of events selected and the low SNR, e.g., chair squeak and door creak are difficult to detect, let alone distinguish, in common seminar audio recordings.…”
Section: Introductionmentioning
confidence: 99%