2020
DOI: 10.1088/1742-6596/1684/1/012041
|View full text |Cite
|
Sign up to set email alerts
|

The Application of LightGBM in Microsoft Malware Detection

Abstract: The development of new technologies has caused computers one of the most popular electronic products. However, there is always a number of people who intend to take advantages of others through attacking others’ computers. To avoid property damage as much as possible, a precise and efficient detection is essential. This work uses the dataset which was generated by combining heartbeat and threat reports collected by Microsoft’ s endpoint protection solution to find out an effective solution. Since the dataset i… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
2
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 9 publications
(8 citation statements)
references
References 6 publications
0
7
0
Order By: Relevance
“…( 27 ), which introduces regularization and reduces the complexity of the Tree. LightGBM ( 28 ) is an improved GBDT framework model, which uses histogram segmentation algorithm to replace the traditional pre-sorting traversal algorithm, with faster parallel training speed and higher accuracy, and can effectively prevent over-fitting. ExtraTrees is an integrated learning algorithm, which contains many decision trees and the classification result is determined by the vote of many decision trees.…”
Section: Methodsmentioning
confidence: 99%
“…( 27 ), which introduces regularization and reduces the complexity of the Tree. LightGBM ( 28 ) is an improved GBDT framework model, which uses histogram segmentation algorithm to replace the traditional pre-sorting traversal algorithm, with faster parallel training speed and higher accuracy, and can effectively prevent over-fitting. ExtraTrees is an integrated learning algorithm, which contains many decision trees and the classification result is determined by the vote of many decision trees.…”
Section: Methodsmentioning
confidence: 99%
“…The aim is to improve the efficiency and effectiveness of Windows malware detection. Our preliminary study revealed that the LightGBM technique which is the best of the GBDT algorithm, has proven to be suitable for Windows malware detection (Abbadi et al, 2020;Pan et al, 2020) and can be improved for effective and efficient malware detection. ML-based classifiers use underlying features to distinguish between malicious and benign applications, and detecting changes in those features when malicious modifies itself.…”
Section: Anomaly-based Detectionmentioning
confidence: 95%
“…The detection time of the model was not considered. Pan et al (2020) used Logistic Regression, KNN and LightGBM to build models based on datasets of heartbeat and threat reports. The results obtained from the respective models show that LightGBM has the highest accuracy with AUC of 0.720687.…”
Section: Anomaly-based Detectionmentioning
confidence: 99%
“…To evaluate their work, most prior work used the area under the receiving operator characteristic curve (ROC), which is hereby referred to as the AUC score. Pan et al (2020) first preprocessed the aforementioned dataset to reduce the memory occupied by the dataset. This was done via the removal of columns that contained a > 95% proportion of null samples, switching several data types to less precise forms and converting several ordinal fields into nominal fields.…”
Section: Related Workmentioning
confidence: 99%