Proceedings of the 34th Annual Computer Security Applications Conference 2018
DOI: 10.1145/3274694.3274700
|View full text |Cite
|
Sign up to set email alerts
|

Accurate Malware Detection by Extreme Abstraction

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 10 publications
(7 citation statements)
references
References 20 publications
0
7
0
Order By: Relevance
“…We showed that machine-learning-based anti-malware engines on VirusTotal produce a substantial number of false positives on packed binaries, which can be due to the limitations discussed in this work. This is especially a serious issue for machine-learning-based approaches that frequently rely on labels from VirusTotal [22,86,88,97], causing an endless loop in which new approaches rely on polluted datasets, and, in turn, generate polluted datasets for future work.…”
Section: Discussionmentioning
confidence: 99%
See 4 more Smart Citations
“…We showed that machine-learning-based anti-malware engines on VirusTotal produce a substantial number of false positives on packed binaries, which can be due to the limitations discussed in this work. This is especially a serious issue for machine-learning-based approaches that frequently rely on labels from VirusTotal [22,86,88,97], causing an endless loop in which new approaches rely on polluted datasets, and, in turn, generate polluted datasets for future work.…”
Section: Discussionmentioning
confidence: 99%
“…As these numbers show, any approach that fails to consider packed benign samples when designing and evaluating a malware detection approach ultimately results in a substantial number of false positives on real-world data. This is especially a concern for machine-learning-based approaches, which, in the absence of reliable and fresh ground truth, frequently rely on labels from anti-malware products available on VirusTotal [22,43,86,88,97]. Given the disagreement of anti-malware products in labeling samples [42,48,65,99], a common practice is to sanitize a dataset, for example, by considering decisions from a selected set of anti-malware products, or, as another example, by using a voting-based consensus.…”
Section: Motivationmentioning
confidence: 99%
See 3 more Smart Citations