The availability of social media, blogs, and websites to everyone creates a lot of problems. False news is a critical issue that can affect individuals or entire countries. Fake news can be created and shared all over the world. The 2016 presidential election in the United States illustrates that problem. As a result, controlling social media is essential. Machine learning (ML) algorithms help to detect fake news automatically. This article proposes a framework for detecting fake news based on feature extraction and feature selection algorithms and a set of voting classifiers. The proposed system distinguishes fake news from real news. First, we preprocessed the data taking unnecessary characters and numbers and reducing the words in the dictionary (lemmatization). Second, we extracted some important features using two feature extraction types: the term frequency-inverse document frequency (TF-IDF) technique and the DOC2VEC algorithm, a word embedding technique. Third, the extracted characteristics were reduced with the help of the chi-square algorithm and the analysis of variance (ANOVA) algorithm. We used three data sets that are published online: Media-Eval, Fake-or-Real-News, and ISOT. To evaluate the proposed framework, we used five different performance metrics: accuracy (ACC), the area under the curve (AUC), precision, recall, and f1-score. Our system achieved 94.6% of ACC for the Fake-or-Real dataset. For the Media-Eval dataset, the system achieved 92.3% of ACC. For the ISOT dataset, the system achieved 100% of ACC. We contrast the proposed framework with several other classification algorithms. The experimental results show that the proposed framework outperforms the existing works in terms of ACC by 0.2% for the ISOT dataset.
INDEX TERMSFake news, News classification, Voting classifier, Term frequency-inverse document frequency, Chi-square Eman Elsaeed received the B.Sc. degree in information technology department from the Faculty of Computers and Information, Mansoura University, Egypt, in 2014. Currently, she is an M.Sc. student at the Faculty of Computers and Information, Mansoura University, Egypt. Her research interests include Artificial intelligence, machine learning, and data mining.