Growing consumerism has led to the importance of online reviews on the Internet. Opinions voiced by these reviews are taken into consideration by many consumers for making financial decisions online. This has led to the development of opinion spamming for profitable motives or otherwise. Work has been done to tackle the challenge of identifying such spammers, but the scale of the real-world review systems demands this problem to be tackled as a Big Data challenge. So, an effort has been made to detect online review spammers using the principle of Big Data. In this work, a rating-based model has been studied under the light of large-scale datasets (more than 80 million reviews by 20 million reviewers) using the Hadoop and Spark frameworks. Scale effects have been identified and mitigated to provide better context to large review systems. An improved computational framework has been presented to compute the overall spamcity of reviewers using exponential smoothing. The value of the smoothing factor was set empirically. Finally, future directions have been discussed.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.