2021
DOI: 10.1007/978-3-030-86230-5_44
|View full text |Cite
|
Sign up to set email alerts
|

A Comparison of Machine Learning Methods for Extremely Unbalanced Industrial Quality Data

Abstract: The Industry 4.0 revolution is impacting manufacturing companies, which need to adopt more data intelligence processes in order to compete in the markets they operate. In particular, quality control is a key manufacturing process that has been addressed by Machine Learning (ML), aiming to improve productivity (e.g., reduce costs). However, modern industries produce a tiny portion of defective products, which results in extremely unbalanced datasets. In this paper, we analyze recent big data collected from a ma… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
3
1

Relationship

1
6

Authors

Journals

citations
Cited by 7 publications
(6 citation statements)
references
References 11 publications
0
6
0
Order By: Relevance
“…Within the scope of this research, the classification framework has been constructed, employing algorithms of distinct methodological underpinnings. Specifically, the selected algorithms include Naive Bayes, esteemed for its probabilistic foundation, alongside the ensemble-based Random Forest, and the individually decisive Decision Tree, each chosen for their unique attributes and relevance to the study's objectives [23][24][25][26][27].…”
Section: Model Selectionmentioning
confidence: 99%
“…Within the scope of this research, the classification framework has been constructed, employing algorithms of distinct methodological underpinnings. Specifically, the selected algorithms include Naive Bayes, esteemed for its probabilistic foundation, alongside the ensemble-based Random Forest, and the individually decisive Decision Tree, each chosen for their unique attributes and relevance to the study's objectives [23][24][25][26][27].…”
Section: Model Selectionmentioning
confidence: 99%
“…For all the datasets, Random UnderSampling (RUS) [69,70] is applied to balance the class distribution by making the cardinality of the majority class comparable to that of the minority class. RUS has demonstrated its effectiveness in diverse fields where the class imbalance is common, such as astrophysics [71,72], geoscience [73], and industrial informatics [74][75][76]. The algorithm randomly selects and removes observations from the majority class until it achieves the desired equilibrium between the two classes.…”
Section: Class Unbalancementioning
confidence: 99%
“…In summary, these results are encouraging and consist of a promising research direction towards a more efficient identification of the stability of slopes, particularly for minority classes. Thus, as a future development, we intend to explore other feature selection approaches as well as different strategies to handle imbalanced data (e.g., the usage of Tomek links or Gaussian copula transformations [72]). In addition, considering the high number of data available for each of the three slope types, a deep learning approach may provide an important contribution toward the prediction models' efficiency.…”
Section: Final Remarksmentioning
confidence: 99%