SoK: The Impact of Unlabelled Data in Cyberthreat Detection

Apruzzese, Giovanni; Laskov, Pavel; Tastemirova, Aliya

doi:10.48550/arxiv.2205.08944

Search citation statements

Order By: Relevance

Paper Sections

Select...

Related Work1

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2023

Publication Types

Select...

Book1

Relationship

Self Cite0

Independent1

Authors

Journals

Cited by 1 publication

(1 citation statement)

References 85 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Apruzzese et al [2] proposed semisupervised methods within the framework of active learning. This is very beneficial to improve the current dataset but it cannot be used to evaluate quality.…”

Section: Related Workmentioning

confidence: 99%

Evaluation of the Limit of Detection in Network Dataset Quality Assessment with PerQoDA

Wasielewska

Soukup

Čejka

et al. 2023

Communications in Computer and Information Science

View full text Add to dashboard Cite

Machine learning is recognised as a relevant approach to detect attacks and other anomalies in network traffic. However, there are still no suitable network datasets that would enable effective detection. On the other hand, the preparation of a network dataset is not easy due to privacy reasons but also due to the lack of tools for assessing their quality. In a previous paper, we proposed a new method for data quality assessment based on permutation testing. This paper presents a parallel study on the limits of detection of such an approach. We focus on the problem of network flow classification and use well-known machine learning techniques. The experiments were performed using publicly available network datasets.

show abstract

Section: Related Workmentioning

confidence: 99%