2023
DOI: 10.1145/3615862
|View full text |Cite
|
Sign up to set email alerts
|

A Systematic Collection of Medical Image Datasets for Deep Learning

Abstract: The astounding success made by artificial intelligence in healthcare and other fields proves that AI can achieve human-like performance. However, success always comes with challenges. Deep learning algorithms are data-dependent and require large datasets for training. Many junior researchers faced a lack of data, because of a variety of reasons. Medical image acquisition, annotation, and analysis are costly, and their usage is constrained by ethical restrictions. They also require several other resources, such… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
9
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
9
1

Relationship

0
10

Authors

Journals

citations
Cited by 14 publications
(9 citation statements)
references
References 212 publications
0
9
0
Order By: Relevance
“…In analyzing the performance of the CNN models for image classification, as shown in Fig. 1, a systematic approach was undertaken within a dedicated repository of image datasets of disasters and emergencies [38]. Next, global variables crucial for model training were modified, encompassing parameters such as the generic seed, number of epochs, learning rates, choice of pre-trained base models including EfficientNetB0, B7, V2B0, and V2L, InceptionV3, ResNet50, and VGG19, together with the pre-processing methods and optimization algorithms like Adam and RMSprop [39].…”
Section: Methodsmentioning
confidence: 99%
“…In analyzing the performance of the CNN models for image classification, as shown in Fig. 1, a systematic approach was undertaken within a dedicated repository of image datasets of disasters and emergencies [38]. Next, global variables crucial for model training were modified, encompassing parameters such as the generic seed, number of epochs, learning rates, choice of pre-trained base models including EfficientNetB0, B7, V2B0, and V2L, InceptionV3, ResNet50, and VGG19, together with the pre-processing methods and optimization algorithms like Adam and RMSprop [39].…”
Section: Methodsmentioning
confidence: 99%
“…Conventional deep-learning methods, as used by the majority of studies in this review, require large amounts of annotated data, which is a tedious, time-consuming, and often prohibitively expensive process. Given the complex regulatory and privacy concerns associated with the sharing of medical images, the field currently suffers from an overall lack of high-quality annotated images, as many datasets are not publicly available for research purposes [118,119]. This issue can be seen in many of the studies covered by this review, which were generally performed as retrospective, single-center studies on internal datasets unless otherwise specified.…”
Section: Limitationsmentioning
confidence: 99%
“…Ethical considerations must also be at the forefront of ML deployment, including patient consent, data privacy, and the mitigation of biases in algorithm development to ensure equitable care across diverse populations (98). Another aspect to be considered is the need for large, high-quality datasets, which can possibly be fostered by data sharing in repositories freely accessible by the research community (99). While this practice can accelerate innovation by providing researchers with rich datasets to train more sophisticated and accurate models, it raises important considerations for ethics and safety.…”
Section: Recommendations For Use and Future Perspectivesmentioning
confidence: 99%