2023
DOI: 10.1007/s11764-023-01465-3
|View full text |Cite
|
Sign up to set email alerts
|

Machine learning (ML) techniques to predict breast cancer in imbalanced datasets: a systematic review

Arman Ghavidel,
Pilar Pazos
Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
8

Relationship

0
8

Authors

Journals

citations
Cited by 12 publications
(5 citation statements)
references
References 80 publications
0
5
0
Order By: Relevance
“…For instance, in breast cancer, machine learning models have been used to predict the likelihood of recurrence and to guide the selection of adjuvant therapy. These models take into account various factors such as tumor size, grade, hormone receptor status, and genomic markers to make their predictions [ 25 , 26 , 27 , 28 , 29 ].…”
Section: Discussionmentioning
confidence: 99%
“…For instance, in breast cancer, machine learning models have been used to predict the likelihood of recurrence and to guide the selection of adjuvant therapy. These models take into account various factors such as tumor size, grade, hormone receptor status, and genomic markers to make their predictions [ 25 , 26 , 27 , 28 , 29 ].…”
Section: Discussionmentioning
confidence: 99%
“…In the domain of machine learning, selecting features is a crucial procedure that boosts the efficacy of models by pinpointing and employing the key features from the data collection [26]. The utilized feature selection method in each of the selected models is shown in TABLE 3.…”
Section: Feature Selectionmentioning
confidence: 99%
“…An imbalanced data problem is a common challenge in predictive models based on clinical datasets because positive cases of a disease or diagnosis are typically part of a minority-class sample [26]. In this case, the majority class samples representing patients with a negative diagnosis are remarkably higher than the minority class samples.…”
Section: Approach To Imbalanced Data Problemmentioning
confidence: 99%
“…Even with large sets of records, the number of ASD cases may be small, resulting in few training examples. Several techniques can be applied, such as collecting more data, over- and under-sampling of existing data, 31 creating synthetic data, eg, with large language models (LLM), 32 or applying techniques such as the Synthetic Minority Over Sampling Technique (SMOTE). 33 …”
Section: Introductionmentioning
confidence: 99%