This paper introduces Big data security analysis with the help of different techniques used in network intrusion detection system. The topic of how big data affects any intrusion detection system being used and how huge volume of the dataset, its specialized features that are heterogeneous in nature and what will happen if big data is processed at real time. Different attacks and intrusion detection methods such as intrusion detection and prevention systems (IDPS), signature-based detection (SD) and anomaly-based detection (AD) has been done. Challenges faced by intrusion detection systems (IDS), how they can be prevented and how machine learning, data mining techniques could be used in any general intrusion detection-based system has also been discussed. Also, how all the problem faced by IDPS can be solved by network simulator named NS-3.0. Its objectives, advantages, comparison with other networks and limitation have also been to be discussed. The recommendation is also given to improve faults. Also, results obtained after using NS-3 based svm classifier using KDD Cup 99 Dataset showed the accuracy of 99 percent.
In today's data-intensive world, millions of data is generated, processed and transferred, The main factors for the generation of data is an increase in the usage of social media and so is the increase in data mining methodologies. Text Classification is one of the most important aspects of data mining which includes fetching of data, pre-processing it and then applying classifiers to divide the data into the categories so as it would be easy to process and subject to further experimentation. In this paper, data is subjected through certain feature selection techniques enhancing its parameters and then applied multiple Machine Learning classifiers on it so as to study various parameters of the data which include accuracy, precision and various averages. The impact of increasing or decreasing the categories for classification of text on accuracy through various classifiers is studied which include Naive Bayes, Support Vector Machine and K-Nearest Neighbour and also the combination of individual classifiers in an ensemble classifier. In this research the internal parameters of Feature Selection Techniques and classifiers are also changed which lead to a slightest increase in overall accuracy of the classifier. Reducing different categories also increases accuracy to a greater extent because it also reduces the presence of multiple similar categories which lead to decrease in overall accuracy. Certain changes in the feature selection parameters are also included which is trying algorithms on uni-gram, bi-gram and tri-gram models and out of which bi-gram shows the best overall accuracy result with Support Vector Machine classifier.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.