Click fraud is the sort of deception in which traffic figures for online ads are intentionally inflated. For businesses that advertise online, click fraud may occur often, resulting in erroneous click statistics and lost funds. That is why many businesses are hesitant to advertise their products on websites and mobile apps. To market their products safely, businesses need a reliable technique for detecting click fraud. In this paper we present a stacking algorithm as a solution to this problem. The proposed method’s premise is to combine multiple learners to achieve an optimal result. The Synthetic Minority Oversampling Technique (SMOTE) with a combination of undersampling are chosen to handle the unbalanced dataset. In the first-level learners, there are four supervised Machine Learning algorithms, which are AdaBoost, Random Forest, Decision Tree and Logistic Regression. Moreover, Logistic Regression is used again as a the second-level learner. To verify the efficacy of the suggested approach, comparative tests are carried out on the public dataset available on Kaggle from China’s largest independent big data service platform TalkingData. Multiple indicators, such as Accuracy, F1 Score, ROC curve, Loss Log and AUC Score, are utilized to analyze the prediction outcomes. The findings reveal that the stacking method improves forecast accuracy while also maintaining a high level of stability.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.