CNN-MoE Based Framework for Classification of Respiratory Anomalies and Lung Disease Detection

Pham, Lam; Phan, Huy; Palaniappan, Ramaswamy; Mertins, Alfred; McLoughlin, Ian

doi:10.1109/jbhi.2021.3064237

Cited by 101 publications

(58 citation statements)

References 36 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The accomplishments of the present study are the following: (i) the hybrid CNN-LSTM approach provides the best combination of performance (sensitivity, specificity, score, accuracy) in comparison with all previous relevant studies (including Pham et al [ 30 ] as results using random 5-fold CV are not reliable), (ii) the proposed model performs well for a highly imbalanced dataset, (iii) the FL function delivers better results than the classic CE function for ECG classification, and (iv) the proposed method could be used for real-time lung sound classification as the prediction phase lasts only a few seconds.…”

Section: Resultsmentioning

confidence: 86%

“…At the same time, Ma et al [ 28 ] introduced a non-local (NL) block into a ResNet and used STFT features for lung sound classification. Yang et al [ 29 ] analyzed STFT features with a ResNet with squeeze and excitation (SE) and spatial attention (SA) blocks for the identification of abnormal lung sounds, while another study by Pham et al [ 30 ] implemented a mixture-of-experts (MoE) block into a CNN structure and used mel spectrogram, gammatone-based spectrogram, MFCC and rectangular constant Q transform (CQT) features for the same purpose. Lastly, Nguyen and Pernkopf [ 31 ] implemented a ResNet to process mel spectrograms and classify respiratory sounds into four different categories.…”

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

Automated Lung Sound Classification Using a Hybrid CNN-LSTM Network and Focal Loss Function

Petmezas

Cheimariotis

Stefanopoulos

et al. 2022

Sensors

View full text Add to dashboard Cite

Respiratory diseases constitute one of the leading causes of death worldwide and directly affect the patient’s quality of life. Early diagnosis and patient monitoring, which conventionally include lung auscultation, are essential for the efficient management of respiratory diseases. Manual lung sound interpretation is a subjective and time-consuming process that requires high medical expertise. The capabilities that deep learning offers could be exploited in order that robust lung sound classification models can be designed. In this paper, we propose a novel hybrid neural model that implements the focal loss (FL) function to deal with training data imbalance. Features initially extracted from short-time Fourier transform (STFT) spectrograms via a convolutional neural network (CNN) are given as input to a long short-term memory (LSTM) network that memorizes the temporal dependencies between data and classifies four types of lung sounds, including normal, crackles, wheezes, and both crackles and wheezes. The model was trained and tested on the ICBHI 2017 Respiratory Sound Database and achieved state-of-the-art results using three different data splitting strategies—namely, sensitivity 47.37%, specificity 82.46%, score 64.92% and accuracy 73.69% for the official 60/40 split, sensitivity 52.78%, specificity 84.26%, score 68.52% and accuracy 76.39% using interpatient 10-fold cross validation, and sensitivity 60.29% and accuracy 74.57% using leave-one-out cross validation.

show abstract

Section: Resultsmentioning

confidence: 86%

Section: Related Workmentioning

confidence: 99%

Automated Lung Sound Classification Using a Hybrid CNN-LSTM Network and Focal Loss Function

Petmezas

Cheimariotis

Stefanopoulos

et al. 2022

Sensors

View full text Add to dashboard Cite

show abstract

“…Test Cond. Specificity(%) Sensitivity(%) Score(%) GMM-HMM [12] original split (60/40) ----39.56% Decision Tree [13] original split (60/40) 75% 12% 43% CNN-MoE [19] original split (60/40) 68% 26% 47% VGG-16(two path) [16] backbone but were beyond the SE-ResNet. For specificity values, the reverse applied.…”

Section: Systemmentioning

confidence: 99%

Respiratory Sound Classification: From Fluid-Solid Coupling Analysis to Feature-Band Attention

et al. 2022

View full text Add to dashboard Cite

Based on respiratory sound production mechanisms, we study the relationship between airflow characteristics in the bronchi and the sound pressure spectrum curves to implement an end-toend respiratory sound classification system with a feature-band attention module. First, we analyse fluidsolid coupling simulations of the bronchi and execute acoustic simulations to obtain spectrum curves of the bronchi at the sound pressure level. Then, based on the spectrum characteristics of the bronchi, we propose an attention strategy to refine acoustic features with adaptive weights. In addition, we introduce a featureband attention module to ResNet-based networks with a squeeze-and-excitation block. Finally, we perform experiments on the ICBHI public database to classify respiratory sounds as belonging to one of four classes: normal, wheezes, crackles, and both (wheezes and crackles). The results show that our proposed system produces superior performance compared to the baseline system. This type of feature learning strategy is useful for exploring distinct characteristics of different types of respiratory sounds.INDEX TERMS fluid-solid coupling, attention learning, end-to-end system, respiratory sound classification, squeeze-and-excitation.

show abstract

“…Furthermore, for these frameworks to be compatible with real time portable or wearable computational devices. This contribution is published in the 42nd Annual International Conferences of the IEEE Engineering in Medicine and Biology Society[32] and being considered for publication in IEEE Journal of Biomedical and Health Informatics[33], the 43th Annual International Conferences of the IEEE Engineering in Medicine and Biology Society[34]…”

mentioning

confidence: 99%

“…With the presence of this knowledge distillation, training the student network, therefore, aims to minimize two losses: (1) the Euclidean distance LOSS EU between the teacher and student embedding, and (2) the standard crossentropy loss LOSS EN on the student's classification output. The combined loss function is therefore, LOSS = (1 − γ)LOSS EN + γLOSS EU(33)…”

mentioning

confidence: 99%

Robust Deep Learning Frameworks For Acoustic Scene and Respiratory Sound Classification

Pham¹

2021

Preprint

View full text Add to dashboard Cite

Although research on Acoustic Scene Classification (ASC) is very close to, or even overshadowed by different popular research areas known as Automatic Speech Recognition (ASR), Speaker Recognition (SR) or Image Processing (IP), this field potentially opens up several distinct and meaningful application areas based on environment context detection.The challenges of ASC mainly come from different noise resources, various sounds in real-world environments, occurring as single sounds, continuous sounds or overlapping sounds.In comparison to speech, sound scenes are more challenging mainly due to their being unstructured in form and closely similar to noise in certain contexts. Although a wide range of publications have focused on ASC recently, they show task-specific ways that either explore certain aspects of an ASC system or are evaluated on limited acoustic scene datasets. Therefore, the aim of this thesis is to contribute to the development of a robust framework to be applied for ASC, evaluated on various recently published datasets, and to achieve competitive performance compared to the state-of-the-art systems.

show abstract

CNN-MoE Based Framework for Classification of Respiratory Anomalies and Lung Disease Detection

Cited by 101 publications

References 36 publications

Automated Lung Sound Classification Using a Hybrid CNN-LSTM Network and Focal Loss Function

Automated Lung Sound Classification Using a Hybrid CNN-LSTM Network and Focal Loss Function

Respiratory Sound Classification: From Fluid-Solid Coupling Analysis to Feature-Band Attention

Robust Deep Learning Frameworks For Acoustic Scene and Respiratory Sound Classification

Contact Info

Product

Resources

About