The scale of data streaming in social networks, such as Twitter, is increasing exponentially. Twitter is one of the most important and suitable big data sources for machine learning research in terms of analysis, prediction, extract knowledge, and opinions. People use Twitter platform daily to express their opinion which is a fundamental fact that influence their behaviors. In recent years, the flow of Iraqi dialect has been increased, especially on the Twitter platform. Sentiment analysis for different dialects and opinion mining has become a hot topic in data science researches. In this paper, we will attempt to develop a real-time analytic model for sentiment analysis and opinion mining to Iraqi tweets using spark streaming, also create a dataset for researcher in this field. The Twitter handle Bassam AlRawi is the case study here. The new method is more suitable in the current day machine learning applications and fast online prediction.
On-going big data from social networks sites alike Twitter or Facebook has been an entrancing hotspot for investigation by researchers in current decades as a result of various aspects including up-to-date-ness, accessibility and popularity; however anyway there may be a trade off in accuracy. Moreover, clustering of twitter data has caught the attention of researchers. As such, an algorithm which can cluster data within a lesser computational time, especially for data streaming is needed. The presented adaptive clustering and classification algorithm is used for data streaming in Apache spark to overcome the existing problems is processed in two phases. In the first phase, the input pre-processed twitter data is viably clustered utilizing an Improved Fuzzy C-means clustering and the proposed clustering is additionally improved by an Adaptive Particle swarm optimization (PSO) algorithm. Further the clustered data streaming is assessed utilizing spark engine. In the second phase, the input pre-processed Higgs data is classified utilizing the modified support vector machine (MSVM) classifier with grid search optimization. At long last the optimized information is assessed in spark engine and the assessed esteem is utilized to discover an accomplished confusion matrix. The proposed work is utilizing Twitter dataset and Higgs dataset for the data streaming in Apache Spark. The computational examinations exhibit the superiority of presented approach comparing with the existing methods in terms of precision, recall, F-score, convergence, ROC curve and accuracy.
Membrane Computing (MC) is defined as one of the main areas in computer sciences; MC has the aim of discovering novel computational models from studying biological cells, specifically the cellular membranes. Mitogen-Activated Protein Kinases (MAPK) cascade was the subject of research in the areas of modeling and simulation. Various software tools such as Performance Evaluation Process Algebra (PEPA) have been used to solve the MAPK cascade for the purpose of improving the effectiveness of signaling. In this study, the MAPK cascade is modeled by using MC. The models of Membrane Computing could be totally fully utilized by applying parallel computing platforms. P-Lingua can be defined as a programming language for MeCoSim and MC, where MC simulators are used to model and simulate MAPK. P-Lingua will be applied to structure, develop and examine the implementation of MAPK cascades in membrane computing. MeCoSim supports charts, outputs, and inputs which have been adapted to MC. The simulation results have been put to comparison with PEPA model. The results indicate that MC improves the MAPK implementation compared to PEPA. This study showed that MC, with its biological characteristics, could improve the implementation regarding biological processes including MAPK.
With the increasing use of encryption in network traffic, anomaly detection in encrypted traffic has become a challenging problem. This study proposes an approach for anomaly detection in encrypted HTTPS traffic using machine learning and compares the performance of different feature selection techniques. The proposed approach uses a dataset of HTTPS traffic and applies various machine learning models for anomaly detection. The study evaluates the performance of the models using various evaluation metrics, including accuracy, precision, recall, F1-score, and area under the curve (AUC). The results show that the proposed approach with feature selection outperforms other existing techniques for anomaly detection in encrypted network traffic. However, the proposed approach has limitations, such as the need for further optimization and the use of a single dataset for evaluation. The study provides insights into the performance of different feature selection techniques and presents future research directions for improving the proposed approach. Overall, the proposed approach can aid in the development of more effective anomaly detection techniques in encrypted network traffic.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.