The significant rise in the frequency and sophistication of cyber-attacks and their diversity necessitated various researchers to develop strong and effective approaches to address recurring cyber threat challenges. This study evaluated the performance of three selected meta-learning models for optimal multi-class detection of cyber-attacks using the University of New South Wales 2015 Network benchmark (UNSW-NB15) Intrusion Dataset. The results of this study show and confirm the ability of the three base models; Naive Bayes, C4.5 Decision Tree, and K-Nearest Neighbor for solving multi-class problems. It further affirms the knack of the duo of feature selection techniques and stacked ensemble learning to optimize ML models' performances. The stacking of the predictions of the information gain base models with Model Decision Tree meta-algorithm recorded the most improved and optimal cyber-attacks detection accuracy and Mattew's correlation Coefficient than the stacking with the Multiple Model Trees (MMT) and Multi Response Linear regression (MLR) Meta algorithms.
Lower Respiratory Tract Infections (LRTIs) are the second and third causes of pediatric patients' death in Nigeria and the United States of America. It is observed from several reviewed literature that the LRTIs accounted for more than a million children morbidity and mortality yearly due to lack of prompt diagnosis or no diagnosis due to a shortage of medical experts and medical facilities in our localities. Intense research is ongoing on applying machine learning (ML) to its clinical diagnosis and reducing its spread in pediatric patients. In this research, K-Nearest Neighbor (KNN), C4.5 Decision Tree, and Naive Bayes' ML algorithms were used to develop three base diagnosis models with Correlation, consistency, and information gain selected feature of the LRTI dataset, Multiple Model Trees (MMT) Meta algorithm is used to combine and improve the diagnoses of all the base models using stacked ensemble. The preliminary diagnosis findings using base models have established that the information gained feature extraction method performed much better than the other two. It, therefore, suffix that the results from this should be used for further processing. All the models built with the reduced feature set recorded improved diagnoses accuracy more than the model built with the whole feature set. The MMT stacked ensemble models recorded an improvement on the diagnosis of LRTIs in Peadiatric, it recorded the highest diagnostic accuracies improvement of 12.80%, 13.52%, and 12.37%, and lowest diagnostic accuracies improvement of 6.37%, 5.22%, and 6.09% with the MMT stacked ensemble models of the Consistency, the Correlation, and the information gain reduced selected feature set respectively. These experimental results show the potential for this approach to deliver a reliable and improved diagnosis of LRTIs. It is recommended to be used to diagnose LRTIs in primary health care centers to reduce its mortality rate.
Phishing;, an identity theft of sensitive information poses a serious challenge to security of personal information, it has worrisome effect on countless number of internet users bringing about a huge financial demand on business and victims alike. Text mining is a branch of Data mining used in analyzing large volume of unstructured text data in order to extract meaningful information from it, Machine learning (ML) is an aspect of artificial Intelligence (AI) that uses the method of data mining to find out new or existing characteristics from a set of gathered data which can be relevant for classification. Machine learning methods has been found to achieve much better result than other phished email detection techniques such as blacklists, visual similarity and heuristic techniques. In this work, text mining of phished and ham emails were carried out, three machine learning techniques:-Naive Bayes, K-Nearest Neighbor and Support Vector Machine were used in identifying phished email on a standard analyzed phished email and Ham corpora. From the result, Naive bayes was found to have highest classification accuracy of 99.0% as against the other two machine learning techniques SVM (98.6%) and KNN (96.9%).
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.