2014 IEEE International Conference on Data Mining 2014
DOI: 10.1109/icdm.2014.47
|View full text |Cite
|
Sign up to set email alerts
|

Spotting Fake Reviews via Collective Positive-Unlabeled Learning

Abstract: Abstract-Online reviews have become an increasingly important resource for decision making and product designing. But reviews systems are often targeted by opinion spamming. Although fake review detection has been studied by researchers for years using supervised learning, ground truth of large scale datasets is still unavailable and most of existing approaches of supervised learning are based on pseudo fake reviews rather than real fake reviews. Working with Dianping 1 , the largest Chinese review hosting sit… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

1
93
0

Year Published

2015
2015
2020
2020

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 138 publications
(94 citation statements)
references
References 7 publications
1
93
0
Order By: Relevance
“…Li proposed a collective classification algorithm named MHCC (Multi-typed Heterogeneous Collective Classification) [6]. The advantage of MHCC is that it can transfer the user suspicious probability to their comments, which is the intermediate result MHCC algorithm.…”
Section: The Spread Of Suspicious Probabilitymentioning
confidence: 99%
See 2 more Smart Citations
“…Li proposed a collective classification algorithm named MHCC (Multi-typed Heterogeneous Collective Classification) [6]. The advantage of MHCC is that it can transfer the user suspicious probability to their comments, which is the intermediate result MHCC algorithm.…”
Section: The Spread Of Suspicious Probabilitymentioning
confidence: 99%
“… CPU (Collective PU-Learning): This method is an improvement of PU-LEA NB [6]. The difference between these two methods is that CPU regards the IP information and user's own information as the classification features which is still the static characteristics.…”
Section: Comparative Experimentsmentioning
confidence: 99%
See 1 more Smart Citation
“…[1] Then again, a lot of writing has been distributed on the systems used to recognize spam and spammers and in addition distinctive sort of investigation on this theme. These systems can be classi fied into various classifications; some utilizing phonetic examples in content, which are for the most part in view of bigram, and unigram [2], others depend on behavioral examples that depend on highlights removed from designs in clients' conduct which are generally metadata based, and even a few procedures utilizing diagrams and chart based calculations and classifiers [3]. The way that anyone with a character winds up endorsed to comment a review, makes an alluring open entryway for spammers to make fake studies arranged particularly to mislead customer's inclination.…”
Section: Introductionmentioning
confidence: 99%
“…Typically, no user profile is available but additional information for products and for reviews are available depending on site. The problem of opinion fraud detection has been addressed by focusing on the review of text information [10,31], and by behavioral approaches that utilize the behavior of fake users [18,19,24]. The problem has often been approached as a supervised classification problem with two classes, fraud and not-fraud.…”
mentioning
confidence: 99%