Proposed for Presentation at the 13th ACM Workshop on Artificial Intelligence and Security Held November 13, 2020. 2020
DOI: 10.2172/1831009
|View full text |Cite
|
Sign up to set email alerts
|

Mind the Gap: On Bridging the Semantic Gap Between Machine Learning and Malware Analysis.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
9
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 9 publications
(9 citation statements)
references
References 0 publications
0
9
0
Order By: Relevance
“…For training, we consider a subset of the 600K-sample Ember training dataset, as well as the full Brazilian dataset; we train a separate MalConv model for each case. Our subsets consist of 10-30K samples, which is consistent with the dataset sizes that many malware research groups currently work with [44]. While we would have preferred to experiment with augmenting more datasets, we are constrained since many existing datasets do not contain raw binaries as mentioned in 2.…”
Section: Discussionmentioning
confidence: 99%
“…For training, we consider a subset of the 600K-sample Ember training dataset, as well as the full Brazilian dataset; we train a separate MalConv model for each case. Our subsets consist of 10-30K samples, which is consistent with the dataset sizes that many malware research groups currently work with [44]. While we would have preferred to experiment with augmenting more datasets, we are constrained since many existing datasets do not contain raw binaries as mentioned in 2.…”
Section: Discussionmentioning
confidence: 99%
“…Several state-of-the-art studies [10], [11], [38]- [41] using ML/DL and genetic algorithm achieve performance gains in detecting and analyzing malicious executables and mobile apps. Additionally, they have shown that automated behavioral profiling or semantic labeling for malware analysis can benefit security analysts and reduce manual work.…”
Section: B Powershell Behavioral Profilingmentioning
confidence: 99%
“…Smith et al [115] have pointed towards the semantic gap between the machine learning and malware analysis communities. One of their proposals is to reposition the task from identifying malware to identifying behavior, making it possible to understand what a malware is doing.…”
Section: Malware Analysismentioning
confidence: 99%