2021
DOI: 10.1186/s40537-021-00460-8
|View full text |Cite
|
Sign up to set email alerts
|

Detecting web attacks using random undersampling and ensemble learners

Abstract: Class imbalance is an important consideration for cybersecurity and machine learning. We explore classification performance in detecting web attacks in the recent CSE-CIC-IDS2018 dataset. This study considers a total of eight random undersampling (RUS) ratios: no sampling, 999:1, 99:1, 95:5, 9:1, 3:1, 65:35, and 1:1. Additionally, seven different classifiers are employed: Decision Tree (DT), Random Forest (RF), CatBoost (CB), LightGBM (LGB), XGBoost (XGB), Naive Bayes (NB), and Logistic Regression (LR). For cl… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
21
0
1

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 54 publications
(26 citation statements)
references
References 36 publications
0
21
0
1
Order By: Relevance
“…(iv) Random undersampling (RUS): this sampling method removes instances from the majority class to improve class imbalances toward the desired target classes. In [26,27], RUS is more successful than other sampling methods. Additionally, RUS has been used in other studies [28,29] to address the issue of class imbalance.…”
Section: Data Preprocessingmentioning
confidence: 99%
“…(iv) Random undersampling (RUS): this sampling method removes instances from the majority class to improve class imbalances toward the desired target classes. In [26,27], RUS is more successful than other sampling methods. Additionally, RUS has been used in other studies [28,29] to address the issue of class imbalance.…”
Section: Data Preprocessingmentioning
confidence: 99%
“…Class Imbalance is another critical attribute to be considered for building a novel and efficient architecture. Zuech et al (22) analyzed web attacks using random undersampling ratios under various ensemble learning algorithms and discussed Most of the research on Intrusion detection mechanisms was based on the ensemble learning approach. Fitni et al (24) made comparisons with seven single classifiers to identify the most appropriate basic classifiers for ensemble learning; they compared the accuracy metrics of tested architectures and made a cumulative study on it.…”
Section: Related Workmentioning
confidence: 99%
“…Class Imbalance is another critical attribute to be considered for building a novel and efficient architecture. Zuech et al ( 22 ) analyzed web attacks using random undersampling ratios under various ensemble learning algorithms and discussed class balance's significant importance. They observed that undersampling at different ratios could have a drastic effect on the model's performance and achieved an accuracy of about 94.01% accuracy on CIC IDS 2018 dataset using RCNN.…”
Section: Related Workmentioning
confidence: 99%
“…It is well-suited for dealing with categorical features and can also handle missing values [29]. Furthermore, it is insensitive to the order of categorical features, which makes it robust to potential data leakage [30].…”
Section: Catboostmentioning
confidence: 99%