2020
DOI: 10.1101/2020.11.07.20227504
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Challenges of Deep Learning Methods for COVID-19 Detection Using Public Datasets

Abstract: A large number of studies in the past months have proposed deep learning-based Artificial Intelligence (AI) tools for automated detection of COVID-19 using publicly available datasets of Chest X-rays (CXRs) or CT scans for training and evaluation. Most of these studies report high accuracy when classifying COVID-19 patients from normal or other commonly occurring pneumonia cases. However, these results are often obtained on cross-validation studies without an independent test set coming from a separate dataset… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
5
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
4
1

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(5 citation statements)
references
References 43 publications
0
5
0
Order By: Relevance
“…An ensemble of three lightweight pre-trained SqueezeNet, ShuffleNet [116], and EfficientNet-B0 [95] at various depths and consolidates feature maps in diverse abstraction levels [ [8,38,53,67,74,84,88,102]. This brings inherent bias on the algorithms as the model tends to learn the distribution of the data source for binary classification problem [32]. Therefore, these models perform very low when used in practical settings, where the models have to adapt to data from different domains [32].…”
Section: Datasets Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…An ensemble of three lightweight pre-trained SqueezeNet, ShuffleNet [116], and EfficientNet-B0 [95] at various depths and consolidates feature maps in diverse abstraction levels [ [8,38,53,67,74,84,88,102]. This brings inherent bias on the algorithms as the model tends to learn the distribution of the data source for binary classification problem [32]. Therefore, these models perform very low when used in practical settings, where the models have to adapt to data from different domains [32].…”
Section: Datasets Resultsmentioning
confidence: 99%
“…This brings inherent bias on the algorithms as the model tends to learn the distribution of the data source for binary classification problem [32]. Therefore, these models perform very low when used in practical settings, where the models have to adapt to data from different domains [32]. Recently, Morozov et al [66] launched a public chest volumetric CT scan dataset with 1110 COVID-19 related studies (see details in subsection 2.1).…”
Section: Datasets Resultsmentioning
confidence: 99%
“…First, in most investigations, results are obtained using crossvalidation without utilizing an independent test set coming from a separate dataset that may have biases. As a result, deep learning models trained in this way are likely to overfit the distribution of training data when independent test sets are not used and are also prone to learn dataset-specific artifacts rather than the truly generalizable disease characteristics [14]. Second, most studies use only a small dataset with very few COVID-19 samples.…”
Section: Introductionmentioning
confidence: 99%
“…Though the results obtained in the current articles are promising, they exhibit limited scope for use as a CAD tool, as most of the works, especially on x-ray images, have been based on data coming from different sources for two distinct classes (Covid Vs. Normal) [14,75,76,79,[84][85][86][87]. This brings inherent bias on the algorithms as the model tends to learn the distribution of the data source for binary classification problems [74]. Therefore, these models perform very low when used in practical settings, where the models have to adapt to data from different domains [74].…”
Section: Introductionmentioning
confidence: 99%
“…This brings inherent bias on the algorithms as the model tends to learn the distribution of the data source for binary classification problems [74]. Therefore, these models perform very low when used in practical settings, where the models have to adapt to data from different domains [74]. Recently, the authors in [88] launched a public chest volumetric CT scan dataset with 1110 COVID-19 related studies (see details in Section 2.1).…”
Section: Introductionmentioning
confidence: 99%