Collecting and visualizing data lineage of Spark jobs

Schoenenwald, Alexander; Kern, Simon; Viehhauser, Josef; Schildgen, Johannes

doi:10.1007/s13222-021-00387-7

Cited by 1 publication

(1 citation statement)

References 15 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Collecting data is the stage of data that has been collected. The data used is sourced from kaggle.com, which is opensource (Schoenenwald et al, 2021).…”

Section: Collecting Datamentioning

confidence: 99%

Global Network Cyberattack Classification Using Naive Bayes Method Time Range 2020 – 2023

Sandi Mutia,

Irawan,

Juliane

2024

Astonjadro

View full text Add to dashboard Cite

This study focuses on developing a classification model for cyberattacks on global networks during the time span of 2020 to 2023 using the Naive Bayes method. The main objective of the study is to analyze and classify the frequent severity of cyber, which helps in improving network security and reducing vulnerabilities. The Naive Bayes method was chosen for its efficiency in handling large datasets and its ability to make predictions based on probabilities. Collecting cyberattack data from a variety of reliable and up-to-date sources, the study covers attacks such as ransomware, phishing, DDoS, and other malware. The classification process includes data pre-processing, feature extraction, and finally the application of Naive Bayes algorithms to identify patterns in such attacks. The classification results are then evaluated using the Apply Model and Performance validation methods to assess the effectiveness of the model. The results of this study show that Naive Bayes is able to accurately classify cyberattacks, providing a useful tool for cybersecurity professionals to understand attack trends and respond proactively. The study also suggests areas for further research, including the integration of the Naive Bayes model with other artificial intelligence systems for improved cyberattack detection. The study provides new insights into the application of the Naive Bayes method in cybersecurity and paves the way for improved data-driven cyber defense strategies.

show abstract

“…Collecting data is the stage of data that has been collected. The data used is sourced from kaggle.com, which is opensource (Schoenenwald et al, 2021).…”

Section: Collecting Datamentioning

confidence: 99%