Proceedings of the 38th Annual Computer Security Applications Conference 2022
DOI: 10.1145/3564625.3567982
|View full text |Cite
|
Sign up to set email alerts
|

DitDetector: Bimodal Learning based on Deceptive Image and Text for Macro Malware Detection

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 10 publications
(4 citation statements)
references
References 36 publications
0
4
0
Order By: Relevance
“…The integration of perceptual hashing and OCR for the detection of deceptive visual elements and textual content, respectively, alongside a novel application of the visual encoder MobileNetV3 and text encoder TextCNN, culminates in a binary classifier. DitDetector excels in mapping images and deceptive textual captures into a shared space, thereby significantly enhancing the accuracy of malicious macro detection through the adept utilization of P-code analysis [30].…”
Section: ) Machine Learning Based Approachesmentioning
confidence: 99%
“…The integration of perceptual hashing and OCR for the detection of deceptive visual elements and textual content, respectively, alongside a novel application of the visual encoder MobileNetV3 and text encoder TextCNN, culminates in a binary classifier. DitDetector excels in mapping images and deceptive textual captures into a shared space, thereby significantly enhancing the accuracy of malicious macro detection through the adept utilization of P-code analysis [30].…”
Section: ) Machine Learning Based Approachesmentioning
confidence: 99%
“…Their detection models were trained and tested using a very large database of malicious and benign MSOffice documents (1.8 million files), collected over a long period of time . Yan et al propose DitDetector for macro malware detection, which leverages bimodal learning based on deceptive images and text [23]. They extracted preview images of documents based on an image export SDK of Oracle and textual information from preview images based on an open-source OCR engine.…”
Section: B Macro Malware Detectionmentioning
confidence: 99%
“…1. Classification models take token representations and train supervised models to predict texts [124,219,220].…”
Section: Data-driven Modelsmentioning
confidence: 99%
“…Classification has been used to determine whether a text is relevant to analyze [168,220], or, once it is determined to be relevant, classified into different TTP categories [88,124,185,219,225]. Hybrid approaches [88,185,225] classify individual tokens or parts of sentences, whereas purely data-driven approaches classify paragraphs or entire documents [124,219,220]. While requiring less a priori knowledge of CTI texts, data-driven classification approaches rely on vast labeled datasets, limiting their application.…”
Section: Classification Modelsmentioning
confidence: 99%