2019
DOI: 10.1016/j.eswa.2018.08.052
|View full text |Cite
|
Sign up to set email alerts
|

Assessing the performances of different neural network architectures for the detection of screams and shouts in public transportation

Abstract: HAL is a multi-disciplinary open access archive for the deposit and dissemination of scientific research documents, whether they are published or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers. L'archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d'enseignement et de recherche français ou étrangers, des labor… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
14
0
1

Year Published

2019
2019
2021
2021

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 31 publications
(15 citation statements)
references
References 48 publications
0
14
0
1
Order By: Relevance
“…Environmental sound classification is an interesting problem (Sigtia et al, 2016;Stowell et al, 2015) which has different applications ranging from crime detection (Radhakrishnan et al, 2005) to environmental context aware processing (Chu et al, 2009). Moreover, with the increasing interest in smart cities, IOT devices embedding automatic audio classification can be very useful for urban acoustic monitoring (Mydlarz et al, 2017) like intelligent audio-based surveillance system in public transportation (Laffitte et al, 2019).…”
Section: Introductionmentioning
confidence: 99%
“…Environmental sound classification is an interesting problem (Sigtia et al, 2016;Stowell et al, 2015) which has different applications ranging from crime detection (Radhakrishnan et al, 2005) to environmental context aware processing (Chu et al, 2009). Moreover, with the increasing interest in smart cities, IOT devices embedding automatic audio classification can be very useful for urban acoustic monitoring (Mydlarz et al, 2017) like intelligent audio-based surveillance system in public transportation (Laffitte et al, 2019).…”
Section: Introductionmentioning
confidence: 99%
“…Since this application deals with audio files, the implementation of the LSTM network is a good fit for their excellent performance of time series recurrent classification. The research work presented in [53] shows that the LSTM network has outperformed DNN and convolutional neural networks (CNN) in scream detection in noisy environments. This is similar to our case since fire emergency scenes tend to be extremely noisy.…”
Section: Classifiermentioning
confidence: 99%
“…As noted before the main problem with other approach in the literature of audio event detection is the data labeling step. In [9] the data were manually cross-labeled by two different audio experts. Authors in [10] used two parallel GMM classifiers to discriminate, respectively, between screams and noise, and between gunshots and noise.…”
Section: Kernel Principal Components Analysismentioning
confidence: 99%
“…The units of each layer represent unknown features which explain the data allowing for various level of abstraction and enabling the output layer to discriminate more efficiently the dataset. DNN has been successfully used in the field of automatic speech recognition [9,23].…”
Section: Deep Neural Network (Dnn)mentioning
confidence: 99%
See 1 more Smart Citation