2019
DOI: 10.1109/access.2019.2911522
|View full text |Cite
|
Sign up to set email alerts
|

Weakly Supervised Deep Learning for the Detection of Domain Generation Algorithms

Abstract: Domain generation algorithms (DGAs) have become commonplace in malware that seeks to establish command and control communication between an infected machine and the botmaster. DGAs dynamically and consistently generate large volumes of malicious domain names, only a few of which are registered by the botmaster, within a short time window around their generation time, and subsequently resolved when the malware on the infected machine tries to access them. Deep neural networks that can classify domain names as b… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
19
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 29 publications
(20 citation statements)
references
References 16 publications
1
19
0
Order By: Relevance
“…In [18], an iterative semi-supervised random forest classifier was constructed to separate dedicated and public IP addresses. In [48], the authors proposed a deep neural networks to classify domain names as benign or malicious, as part of DGAs. These methods, however, have not considered the problem of identifying new malicious domains related to an observed campaign, as considered in this paper.…”
Section: ) Algorithmic Methodsmentioning
confidence: 99%
“…In [18], an iterative semi-supervised random forest classifier was constructed to separate dedicated and public IP addresses. In [48], the authors proposed a deep neural networks to classify domain names as benign or malicious, as part of DGAs. These methods, however, have not considered the problem of identifying new malicious domains related to an observed campaign, as considered in this paper.…”
Section: ) Algorithmic Methodsmentioning
confidence: 99%
“…However, this model still cannot effectively distinguish between word-based algorithmically generated domain names and legitimate ones. They also studied the problem of how to supply sufficient labeled training data for deep learning-based DGA classifiers [ 25 ]. Zeng et al.…”
Section: Related Workmentioning
confidence: 99%
“…G ij E ij y j |θ i (11) Then in the M-step, we define the expectations of log S(y, γ |θ) as M function as following:…”
Section: ) Probability Fusion Modulementioning
confidence: 99%
“…Similarly, we also update the gating parameterĜ i under the constraint 3 i=1 G ij = 1. The overall process of two-step loops from Equation (11) to (14) is terminated if any parameters move will no longer lead to further minimization of the M function, leading to the final estimation of labeling result. Therefore, the next iteration parameters are rewritten as Equation (14).…”
Section: ) Probability Fusion Modulementioning
confidence: 99%
See 1 more Smart Citation