Proceedings of the 2017 ACM SIGSAC Conference on Computer and Communications Security 2017
DOI: 10.1145/3133956.3134018
|View full text |Cite
|
Sign up to set email alerts
|

Neural Network-based Graph Embedding for Cross-Platform Binary Code Similarity Detection

Abstract: The problem of cross-platform binary code similarity detection aims at detecting whether two binary functions coming from different platforms are similar or not. It has many security applications, including plagiarism detection, malware detection, vulnerability search, etc. Existing approaches rely on approximate graphmatching algorithms, which are inevitably slow and sometimes inaccurate, and hard to adapt to a new task. To address these issues, in this work, we propose a novel neural network-based approach t… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
536
0
2

Year Published

2018
2018
2021
2021

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 487 publications
(572 citation statements)
references
References 32 publications
0
536
0
2
Order By: Relevance
“…Machine learning based approaches. Machine learning, including deep learning, has been applied to code analysis [10], [38], [21], [62], [50], [43], [51], [29], [59], [28], [52], [27], [17]. Lee et al propose Instruction2vec for converting assembly instructions to vector representations [38]; but their instruction embedding model can only work on a single architecture.…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations
“…Machine learning based approaches. Machine learning, including deep learning, has been applied to code analysis [10], [38], [21], [62], [50], [43], [51], [29], [59], [28], [52], [27], [17]. Lee et al propose Instruction2vec for converting assembly instructions to vector representations [38]; but their instruction embedding model can only work on a single architecture.…”
Section: Related Workmentioning
confidence: 99%
“…A few works target cross-architecture binary code analysis [21], [62], [8], [67]. Some exploit the statistical aspects of code, rather than its semantics.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Neural Networks with Structure2vec. In [106], a deep graph embedding approach is proposed for cross-platform binary code similarity detection. A Siamese architecture is applied to enable the pair-wise similarity learning, and the graph embedding network based on Structure2vec [27] is used for learning graph representations in the twin networks, which share weights with each other.…”
Section: Graph-level Embedding Based Methodsmentioning
confidence: 99%