2021
DOI: 10.46481/jnsps.2021.331
|View full text |Cite
|
Sign up to set email alerts
|

Optimized Breast Cancer Classification using Feature Selection and Outliers Detection

Abstract: Breast cancer is the second most commonly diagnosed cancer in women throughout the world. It is on the rise, especially in developing countries, where the majority of cases are discovered late. Breast cancer develops when cancerous tumors form on the surface of the breast cells. The absence of accurate prognostic models to assist physicians recognize symptoms early makes it difficult to develop a treatment plan that would help patients live longer. However, machine learning techniques have recently been used t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
9
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
3

Relationship

0
10

Authors

Journals

citations
Cited by 16 publications
(9 citation statements)
references
References 31 publications
0
9
0
Order By: Relevance
“…As an effect, several outlier detection algorithms are used as pipelines in modelling, mainly to improve the performance and robustness of the model. Several outlier detection techniques, such as isolation forest, LOF, One-class SVM, DBSCAN, K-Means, and others, have been widely used in mining outliers in large datasets in several cases, similarly preventing credit card fraud and data leakage [21], tumor classification, breast cancer detection, patient monitoring through ECG signals [22,23], and studying metabolism [24]. The authors used historical datasets from different sources, such as Kaggle, the ML repository at UCI, and real-world datasets, to analyze and create synthetic datasets for training models.…”
Section: Related Workmentioning
confidence: 99%
“…As an effect, several outlier detection algorithms are used as pipelines in modelling, mainly to improve the performance and robustness of the model. Several outlier detection techniques, such as isolation forest, LOF, One-class SVM, DBSCAN, K-Means, and others, have been widely used in mining outliers in large datasets in several cases, similarly preventing credit card fraud and data leakage [21], tumor classification, breast cancer detection, patient monitoring through ECG signals [22,23], and studying metabolism [24]. The authors used historical datasets from different sources, such as Kaggle, the ML repository at UCI, and real-world datasets, to analyze and create synthetic datasets for training models.…”
Section: Related Workmentioning
confidence: 99%
“…Initially, rule-based management systems were used for creating fraud patterns, but this became too complex for manual analysis and Machine Learning (ML) techniques were adopted. ML techniques have also been applied to solve other complex problems [2][3][4][5][6]. Although neural networks emerged first, they have the limitation of using a black-box model.…”
Section: Related Workmentioning
confidence: 99%
“…Observations that deviate from the distribution's general shape or pattern are called outliers [3]. The relationship between the observed and the dependent variable can be estimated by OLS regression, by minimizing the sum of squares [4].…”
Section: Introductionmentioning
confidence: 99%