2022
DOI: 10.1109/access.2022.3205337
|View full text |Cite
|
Sign up to set email alerts
|

Effect of Balancing Data Using Synthetic Data on the Performance of Machine Learning Classifiers for Intrusion Detection in Computer Networks

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
8
1
1

Relationship

0
10

Authors

Journals

citations
Cited by 35 publications
(11 citation statements)
references
References 62 publications
0
11
0
Order By: Relevance
“…The training and test sets for the ANNs are then partially balanced to limit overlearning of a smaller number of output classes [25]. Partial balancing is performed by calculating the median of the number of real-world destinations for each type of surgery, with specific destination quantities above the median limited to the median number, and destination quantities below the median utilizing all available cases, for each of the 9 surgical specialties.…”
Section: Methodsmentioning
confidence: 99%
“…The training and test sets for the ANNs are then partially balanced to limit overlearning of a smaller number of output classes [25]. Partial balancing is performed by calculating the median of the number of real-world destinations for each type of surgery, with specific destination quantities above the median limited to the median number, and destination quantities below the median utilizing all available cases, for each of the 9 surgical specialties.…”
Section: Methodsmentioning
confidence: 99%
“…Chui et al [29] proposed a three-stage data generation algorithm that combines synthetic minority oversampling technique with generative adversarial networks (GANs) and variational autoencoder to produce high-quality data, aiming to address the issue of generating additional training samples for minority class in adversarial attacks. In addition to utilizing random oversampling, Dina et al [30] employed a data synthesis approach called a conditional generative adversarial network (CTGAN) to balance the data and improve the accuracy of various machine learning classifiers on the NSL-KDD and UNSW-NB15 intrusion detection datasets. Furthermore, Gaggero et al [31] proposed an anomaly detection algorithm based on neural network autoencoders by analyzing the typical architecture of the storage system in microgrids and potential vulnerabilities therein.…”
Section: Related Workmentioning
confidence: 99%
“…[39], [40], [41], [42], [43] A. Generative Models 1) TimeGAN: TimeGAN, a generative model designed for time series data, leverages a Generative Adversarial Network (GAN) framework to generate synthetic time series data that closely resembles the original data's statistical properties and dependencies [44], [45]. It comprises two main components: the generator and the discriminator.…”
Section: Data Expansion Techniquesmentioning
confidence: 99%