Interspeech 2019 2019
DOI: 10.21437/interspeech.2019-2953
|View full text |Cite
|
Sign up to set email alerts
|

DeepLung: Smartphone Convolutional Neural Network-Based Inference of Lung Anomalies for Pulmonary Patients

Abstract: DeepLung is an end-to-end deep learning based audio sensing and classification framework for lung anomaly (e.g. cough, wheeze) detection for pulmonary patients from streaming audio and inertial sensor data from a chest-held smartphone. We design and develop 1-D and 2-D convolutional neural networks for DeepLung, and train them using the Interspeech 2010 Paralinguistic Challenge features. Two different audio windowing schemes: i) real-time respiration cycle based natural windowing, and ii) static length windowi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
3
3

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(5 citation statements)
references
References 17 publications
0
5
0
Order By: Relevance
“…In ongoing work we are extending the dataset, hoping that this will allow more advanced classifiers, such as complex neural architectures, to perform better than the LR baseline [53,54]. We are also continuing our attempts to improve classification performance by including audio captured simultaneously from two stethoscope channels.…”
Section: Discussionmentioning
confidence: 99%
“…In ongoing work we are extending the dataset, hoping that this will allow more advanced classifiers, such as complex neural architectures, to perform better than the LR baseline [53,54]. We are also continuing our attempts to improve classification performance by including audio captured simultaneously from two stethoscope channels.…”
Section: Discussionmentioning
confidence: 99%
“… 70 A study inferred lung anomalies by applying a CNN model to audio and inertial sensor data from smartphones for pulmonary patients. 71 Depression was diagnosed by analyzing voice samples of PD patients with MLP, 74 which showed high prediction accuracy of 77% compared with other prediction models based on machine learning (0.62–0.76). Encoders, meanwhile, have been applied to predict morbidity and mortality.…”
Section: Deep Learning Methods For Prediction and Dimensionality Redu...mentioning
confidence: 99%
“…Ahmed et al 68 , Kim et al 69 , Srivastava et al 70 , Ahmed et al 71 Bidja 72 , Zhang et al 73 MLP 74 , Encoder 75 Integration of multimodal data Jung et al 76 Rutkowski et al 77 , Han et al 78 Dong et al 79 CNN and Encoder 30 , Transformer 80 , DNN (not specified) 81 Biosignals accompanying environmental information Kanjo et al 82 , Mou et al 83 He et al edge. After constructing the two GCN layers, an FC layer of softmax activation was combined.…”
Section: Activity Datamentioning
confidence: 99%
“…This approach is relatively non-intrusive, and can be completely non-contact, requiring only the presence of an acoustic sensing device within a reasonable proximity (primarily governed by the level of background noise) and a device to facilitate the requisite signal processing. Some implementations have demonstrated use of a contact microphone [ 162 , 163 , 164 , 165 ]; however, many modern approaches have adopted the ubiquitous smart-phone for audio capture (and in some instances, also processing) device [ 166 , 167 , 168 , 169 ]. A cough event lasts somewhere between 300 [ 170 ] and 650 ms [ 171 ], and can be described by three distinct phases: the expulsive phase, the intermediate phase, and the voiced phase [ 172 ].…”
Section: Acoustic Detection Of Respiratory Infectionmentioning
confidence: 99%