Windows malware detection based on static analysis with multiple features

Yousuf, Muhammad Irfan; Anwer, Izza; Riasat, Ayesha; Zia, Khawaja Tahir; Kim, Suhyun

doi:10.7717/peerj-cs.1319

Cited by 5 publications

(3 citation statements)

References 27 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The researchers used Python programming language under the Jupyter platform for implementing their model codes. In additional, in [58] employed seven machine learning classifiers (Naïve Bayes, SVM, Decision, Random Forest, KNN, Nearest Centroid and Gradient Boost) in classifying 29,797 samples of Portable Executable (PE) malware collected from various sources including files from Windows installation. The best obtained accuracy was with Random Forest with PCA as dimensionality reducer reached 99.41%.…”

Section: Principal Component Analysis Algorithm In Digital Forensicsmentioning

confidence: 99%

“…Issues related to multicollinearity [57] 2022 Analysis and investigate Malware Forensic PCA, K-Means Detecting zero-day attacks using ML approaches and feature engineering. [58] 2023 Malware Forensic PCA Static malware detection framework by mining DLLs, and API calls from each DLL using ML approach and feature selecting.…”

Section: Networking Forensicmentioning

confidence: 99%

“…[52] SVM Imbalanced Datasets SVM has difficulties dealing with the imbalanced data set that occasionally characterizes digital forensic data. [58] PCA Assumption of Linearity…”

Section: Nb Assumption Of Independencementioning

confidence: 99%

See 2 more Smart Citations

The current landscape of digital forensics employing machine learning approaches: A Review

Talabani

2024

Int. J. Comput. Artif. Intell.

View full text Add to dashboard Cite

This paper discusses and evaluates the present state of digital forensics, as well as how machine learning techniques are used in this field. The paper covers technological advances in forensics medicine and how we may gain from the performance of machine learning algorithms to compare their performances for improvement on data collection, analysis and investigation. The focus is on the benefits and challenges that may arise while adopting algorithms: Naive Bayes (NB), K-Nearest Neighbor (K-NN), Support Vector Machine (SVM), Principal Component Analysis (PCA) and Kmeans. Apart from analyzing the latest research and studies in this subject area. Furthermore, tracing new trends in the digital forensics' domain and outline ways that machine learning can be used for better performance.

show abstract

Section: Principal Component Analysis Algorithm In Digital Forensicsmentioning

confidence: 99%

Section: Networking Forensicmentioning

confidence: 99%

See 1 more Smart Citation

The current landscape of digital forensics employing machine learning approaches: A Review

Talabani

2024

Int. J. Comput. Artif. Intell.

View full text Add to dashboard Cite

show abstract

A comprehensive analysis combining structural features for detection of new ransomware families

Moreira,

Sales

2024

Journal of Information Security and Applications

View full text Add to dashboard Cite

A Deep Reinforcement Learning Framework to Evade Black-Box Machine Learning Based IoT Malware Detectors Using GAN-Generated Influential Features

Arif,

Aslam,

Al-Otaibi

et al. 2023

IEEE Access

View full text Add to dashboard Cite

In the internet of things (IoT) networks, machine learning (ML) is significantly used for malware and adversary detection. Recently, research has shown that adversarial attacks have put ML-based models at risk. This problem is exacerbated in an IoT environment because of the absence of adequate security measures. Consequently, it is crucial to evaluate the strength of such malware detectors using powerful adversarial samples. The existing adversarial sample generation strategies either rely on high-level image features or an unfiltered feature set, making it challenging to determine which feature modifications are crucial in evading malware detection systems, without compromising the malware functionality. This encourages us to propose an evasion framework named IF-MalEvade, based on Generative Adversarial Network (GAN) and Deep Reinforcement Learning (DRL) that effectively generates fully-working, malware samples with several effective perturbations such as header section manipulation and benign bytes insertion. The DRL framework selects a few suitable action sequences to change malicious samples, thus allowing our malware samples to bypass various black-box ML based malware detectors and the detection search engines of VirusTotal, while maintaining the executability and malicious behavior of the original malware samples. The neural networks of GAN take in the unfiltered feature set of malware dataset and using minimax objective function yields a set of useful features that are subsequently used by the DRL agent to make effective changes. Experimental results illustrated that by utilizing the influential features in sequence of transformations, the adversarial samples generated by our model outperformed the state-of-the-art evasion models with an impressive evasion rate. Additionally, the detection rate of well-known machine learning models was also brought down to upto 97%. Furthermore, when the machine learning models were retrained using adversarial samples, a 35% increase in detection accuracy was observed.

show abstract

Windows malware detection based on static analysis with multiple features

Cited by 5 publications

References 27 publications

The current landscape of digital forensics employing machine learning approaches: A Review

The current landscape of digital forensics employing machine learning approaches: A Review

A comprehensive analysis combining structural features for detection of new ransomware families

A Deep Reinforcement Learning Framework to Evade Black-Box Machine Learning Based IoT Malware Detectors Using GAN-Generated Influential Features

Contact Info

Product

Resources

About