2022
DOI: 10.48550/arxiv.2205.08944
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

SoK: The Impact of Unlabelled Data in Cyberthreat Detection

Giovanni Apruzzese,
Pavel Laskov,
Aliya Tastemirova

Abstract: Machine learning (ML) has become an important paradigm for cyberthreat detection (CTD) in the recent years. A substantial research effort has been invested in the development of specialized algorithms for CTD tasks. From the operational perspective, however, the progress of MLbased CTD is hindered by the difficulty in obtaining the large sets of labelled data to train ML detectors. A potential solution to this problem are semisupervised learning (SsL) methods, which combine small labelled datasets with large a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 85 publications
0
1
0
Order By: Relevance
“…Apruzzese et al [2] proposed semisupervised methods within the framework of active learning. This is very beneficial to improve the current dataset but it cannot be used to evaluate quality.…”
Section: Related Workmentioning
confidence: 99%
“…Apruzzese et al [2] proposed semisupervised methods within the framework of active learning. This is very beneficial to improve the current dataset but it cannot be used to evaluate quality.…”
Section: Related Workmentioning
confidence: 99%