2020
DOI: 10.3390/electronics9071070
|View full text |Cite
|
Sign up to set email alerts
|

Effective DGA-Domain Detection and Classification with TextCNN and Additional Features

Abstract: Malicious codes, such as advanced persistent threat (APT) attacks, do not operate immediately after infecting the system, but after receiving commands from the attacker’s command and control (C&C) server. The system infected by the malicious code tries to communicate with the C&C server through the IP address or domain address of the C&C server. If the IP address or domain address is hard-coded inside the malicious code, it can analyze the malicious code to obtain the address and block acce… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
2
0
1

Year Published

2021
2021
2024
2024

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 11 publications
(4 citation statements)
references
References 18 publications
0
2
0
1
Order By: Relevance
“…TextCNN 52 is a text classification model based on CNN proposed by Yoon Kim in 2014. Due to its extraordinary ability in extracting text-related regions and features from image components, 53 this model has been widely applied in many different research fields, such as feature extraction, 54 , 55 classification, 56 program detection, 57 and etc. The TextCNN model performs convolutional operations on the word vectors by three convolutional kernels in the convolutional layer to generate feature vectors, performs maximum pooling of the convolved feature vectors in the pooling layer, and finally outputs the features in the fully connected layer.…”
Section: Methodsmentioning
confidence: 99%
“…TextCNN 52 is a text classification model based on CNN proposed by Yoon Kim in 2014. Due to its extraordinary ability in extracting text-related regions and features from image components, 53 this model has been widely applied in many different research fields, such as feature extraction, 54 , 55 classification, 56 program detection, 57 and etc. The TextCNN model performs convolutional operations on the word vectors by three convolutional kernels in the convolutional layer to generate feature vectors, performs maximum pooling of the convolved feature vectors in the pooling layer, and finally outputs the features in the fully connected layer.…”
Section: Methodsmentioning
confidence: 99%
“…Hwang et al [27] used 10 context-free features and in addition they extracted 100 features using a TextCNN. The TextCNN takes as input a 70 × 100 matrix for each domain name, constructed by taking 100 characters from the domain name (using truncation for longer domain names and padding for shorter domain names) and one-hot encoding with a dictionary of 70 characters.…”
Section: Context-free Featuresmentioning
confidence: 99%
“…However, they create a whole detection chain by not only detecting the domains through semantic similarity, but also embedding the domains in case the semantic similarity did not trigger the alarms. On the same line, (Hwang, 2020) presents a method to detect and classify DGA by extracting features and passing them to a CNN-based model that labels the domains as DGA or legit. More deep learning based DGA detection research works can be found, such as (Tuan, 2022) where LSTM based techniques are used, or (Aravamudu, 2022) where various ML classifiers are tested against this task.…”
Section: Related Workmentioning
confidence: 99%