2019
DOI: 10.3390/app9224764
|View full text |Cite
|
Sign up to set email alerts
|

Malicious PDF Detection Model against Adversarial Attack Built from Benign PDF Containing JavaScript

Abstract: Intelligent attacks using document-based malware that exploit vulnerabilities in document viewing software programs or document file structure are increasing rapidly. There are many cases of using PDF (portable document format) in proportion to its usage. We provide in-depth analysis on PDF structure and JavaScript content embedded in PDFs. Then, we develop the diverse feature set encompassing the structure and metadata such as file size, version, encoding method and keywords, and the content features such as … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
6
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 13 publications
(8 citation statements)
references
References 9 publications
0
6
0
Order By: Relevance
“…Using countless ML and DL models, several varieties of research have been managing on the identification of PDF malware. Kang et al described the use of the PDF in 2019 [15]. They gave a thorough analysis of the JavaScript structure and content in the PDF with embedded XML.…”
Section: Literature Reviewmentioning
confidence: 99%
“…Using countless ML and DL models, several varieties of research have been managing on the identification of PDF malware. Kang et al described the use of the PDF in 2019 [15]. They gave a thorough analysis of the JavaScript structure and content in the PDF with embedded XML.…”
Section: Literature Reviewmentioning
confidence: 99%
“…In [26], authors examined PDF design and JavaScript information included in PDFs from top to bottom. With regard to design and metadata, they created an extensive set of capabilities, such as the count of bytes per second, the encoding scheme, object names, catchphrases, and comprehensible strings in JavaScript.…”
Section: File-based Malware Identification Related Workmentioning
confidence: 99%
“…Authors in [39] offer in-depth analyses of PDFs' JavaScript content and structure. Then create a rich feature set in JavaScript that includes content features like object names, keywords, and readable strings, as well as the structure and metadata features like file size, version, encoding method, and keywords.…”
Section: Literature Reviewmentioning
confidence: 99%
“…The PDF files used in this study comprise 9,000 benign and 11,097 malicious document files gathered by the Contagio malware dump between November 2009 and June 2018 [39]. The malware samples are provided via the Contagio malware dump site.…”
Section: Literature Reviewmentioning
confidence: 99%