2010 5th IEEE Conference on Industrial Electronics and Applications 2010
DOI: 10.1109/iciea.2010.5515397
|View full text |Cite
|
Sign up to set email alerts
|

Scream detection for home applications

Abstract: Audio signal is an important clue for the situation awareness. It provides complementary information for video signal. For home care, elder care, and security application, screaming is one of the events people (family member, care giver, and security guard) are especially interested in. We present here an approach to scream detection, using both analytic and statistical features for the classification. In audio features, sound energy is a useful feature to detect scream like audio. We adopt the log energy to d… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
6
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 32 publications
(6 citation statements)
references
References 18 publications
0
6
0
Order By: Relevance
“…In [6] they employed analytical features followed by SVM classification for event identification, while [7] utilized MFCC, MPEG-7 features, and Hidden Markov Models (HMMs) for gunshot and scream classification. A parallel GMM classifier network achieved a reported 90% accuracy with an 8% false alarm rate in [8].…”
Section: Literature Surveymentioning
confidence: 99%
“…In [6] they employed analytical features followed by SVM classification for event identification, while [7] utilized MFCC, MPEG-7 features, and Hidden Markov Models (HMMs) for gunshot and scream classification. A parallel GMM classifier network achieved a reported 90% accuracy with an 8% false alarm rate in [8].…”
Section: Literature Surveymentioning
confidence: 99%
“…These results are comparable to the ones obtained by other works that implemented real-time applications of audio processing. Examples include audio processing times for real-time predictions of around 1000 ms in [6], and between 104 to 1360 ms in [22].…”
Section: Prototype Implementationmentioning
confidence: 99%
“…For example, the database presented in [5] includes audios with utterances of emotional speech. Moreover, the databases presented in [6,7] include audios that are labeled with different types of emotions and recorded in low noise environments as houses or closed locations. The work in [8] presented a large database that includes screaming sounds that were artificially overlapped with white additive Gaussian noise.…”
Section: Introductionmentioning
confidence: 99%
“…Automated systems can be used to detect exclamations of panic in an emergency and distinguish them from sounds from people quarreling or talking loudly; these systems are useful in subway systems because they can help station staff detect and react to emergencies quickly [1]. Thus, in the present study, we developed a convolutional neural network (CNN) that classifies and detects panicked vocalizations on the mass rapid transit (MRT) system in Taipei.…”
Section: Introductionmentioning
confidence: 99%