ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2021
DOI: 10.1109/icassp39728.2021.9414385
|View full text |Cite
|
Sign up to set email alerts
|

Contrastive Embeddind Learning Method for Respiratory Sound Classification

Abstract: Respiratory sound classification refers to identifying adventitious sounds from given recordings automatically. Due to the difficulty of collection and the expensive manual annotation, there are only limited samples available, which impacts on learning better models. Meanwhile, a majority of these models do not explicitly encourage intra-class compactness and inter-class separability between the learned embeddings, leading to the difficulty of identifying several samples and a reduced generalization performanc… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
11
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
2
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 24 publications
(11 citation statements)
references
References 9 publications
0
11
0
Order By: Relevance
“…Evaluation Metrics. We report the unweighted average recall (UAR) as the generic classification benchmark instead of accuracy, as UAR can provide fairer evaluation of the models over the four classes than accuracy [9,22] in case of imbalance. It is also common to distinguish abnormal audio samples (i. e., crackles, wheezes, and both) from normal cases.…”
Section: Experimental Settingsmentioning
confidence: 99%
See 1 more Smart Citation
“…Evaluation Metrics. We report the unweighted average recall (UAR) as the generic classification benchmark instead of accuracy, as UAR can provide fairer evaluation of the models over the four classes than accuracy [9,22] in case of imbalance. It is also common to distinguish abnormal audio samples (i. e., crackles, wheezes, and both) from normal cases.…”
Section: Experimental Settingsmentioning
confidence: 99%
“…As a result, it is difficult for healthcare practitioners to fully trust the predictions if no explanation is available, especially when many respiratory sound classification results still have modest performance (e. g., the average score of around 50.16 % on the ICBHI 2017 dataset [8,2]). Existing works tried to mitigate this problem with data augmentation [9] to feed more data to DNNs. Nevertheless, the interpretability of a model is crucial in high-stake domains such as healthcare [10,11,12].…”
Section: Introductionmentioning
confidence: 99%
“…Evaluation Metrics. We report the unweighted average recall (UAR) as the generic classification benchmark instead of accuracy, as UAR can provide fairer evaluation of the models over the four classes than accuracy [8,20] in case of imbalance. It is also common to distinguish abnormal audio samples (i. e., crackles, wheezes, and both) from normal cases.…”
Section: Experimental Settingsmentioning
confidence: 99%
“…As a result, it is difficult for healthcare practitioners to fully trust the predictions if no explanation is available, especially when many respiratory sound classification results still have modest performance (e. g., the average score of around 50.16 % on the ICBHI 2017 dataset [7,2]). Existing works tried to mitigate this problem with data augmentation [8] to feed more data to DNNs. Nevertheless, the interpretability of a model is crucial in high-stake domains such as healthcare, where mistakes can cause significant damage [9,10].…”
Section: Introductionmentioning
confidence: 99%
“…Feature Extraction Methods plays a key role in the sound analysis frameworks, reducing redundant information and improving the classification accuracy [59]. Various feature extraction methods have been explored in the state-of-the-art works, such as mel-frequency cepstral coefficient (MFCC) [8], [15], [28], [30], [60]- [65], log-mel spectrogram (Log-Mel) [59], [66], [67], mel spectrogram [34], [68]- [70] (Mel), and short-time fourier transform (STFT) spectrogram [31],…”
Section: Classification Framework For Database Quality Evaluationmentioning
confidence: 99%