Journal of Biomed Research 2022
DOI: 10.46439/biomedres.3.025
|View full text |Cite
|
Sign up to set email alerts
|

Racial underrepresentation in dermatological datasets leads to biased machine learning models and inequitable healthcare

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
1
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(3 citation statements)
references
References 42 publications
0
3
0
Order By: Relevance
“…In this work we show how protein Training data bias Biases in training data are reflected in downstream models. Under-represented subgroups can suffer lower accuracy due to insufficient weight in the training data (Buolamwini & Gebru, 2018;Chen et al, 2018;Kleinberg et al, 2022;Shahbazi et al, 2023), and socially undesirable biases in data are often amplified by models (Bolukbasi et al, 2016;Caliskan et al, 2017;Taori & Hashimoto, 2023). Various papers have studied how re-weighting or curating datasets can mitigate these biases (Zhao et al, 2017;Ryu et al, 2017;Tschandl et al, 2018;Yang et al, 2020), even finding that overall performance is improved by over-weighting minority groups and actively increasing diversity in datasets (Gao et al, 2020;Rolf et al, 2021;Lee et al, 2022).…”
Section: Related Workmentioning
confidence: 99%
“…In this work we show how protein Training data bias Biases in training data are reflected in downstream models. Under-represented subgroups can suffer lower accuracy due to insufficient weight in the training data (Buolamwini & Gebru, 2018;Chen et al, 2018;Kleinberg et al, 2022;Shahbazi et al, 2023), and socially undesirable biases in data are often amplified by models (Bolukbasi et al, 2016;Caliskan et al, 2017;Taori & Hashimoto, 2023). Various papers have studied how re-weighting or curating datasets can mitigate these biases (Zhao et al, 2017;Ryu et al, 2017;Tschandl et al, 2018;Yang et al, 2020), even finding that overall performance is improved by over-weighting minority groups and actively increasing diversity in datasets (Gao et al, 2020;Rolf et al, 2021;Lee et al, 2022).…”
Section: Related Workmentioning
confidence: 99%
“…To mitigate these risks and select the most suitable solution, we usually carry out an accurate a-priori analysis considering factors such as the specific characteristics of the dataset (i.e., data types), the proportion of missing data and the underlying mechanisms causing the missingness [35]. Indeed, missing data can be classified into three different categories depending on the missing data mechanism.…”
Section: Addressing Missing Datamentioning
confidence: 99%
“…According to the “2022 Cancer Facts & Data,” the United States has over 5 million new cases of skin cancer annually, with the melanoma incidence rate for white non-Hispanic people being 30 times that of other ethnicities. Nonetheless, much current academic research and challenges from the ISIC database mainly focus on the identification of benign and malignant skin tumors (Kleinberg et al, 2022 ), which means they do not satisfy the requirements of the Chinese ethnic group. Secondly, China and other countries experience a high incidence of skin diseases.…”
Section: Introductionmentioning
confidence: 99%