2014 IEEE International Conference on Progress in Informatics and Computing 2014
DOI: 10.1109/pic.2014.6972296
|View full text |Cite
|
Sign up to set email alerts
|

Web spam detection based on improved tri-training

Abstract: Web spamming is the deliberate manipulation of search engine indexes to make a page get high ranking than which it deserved considering its true value. Since the evolution of web spam, a new based on machine learning algorithm web spam detection method which has self-learning ability has emerged. Web spam detection is viewed as a binary classification learning problem. Because labeled training examples are fairly expensive to obtain which need the participation of experts in this field and labor costs, how to … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2016
2016
2020
2020

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 11 publications
0
2
0
Order By: Relevance
“…Through EEMD process, the original vibration signals collected from the two cases are transformed into two feature sets. According to [18], two parameters of EEMD, that is, the ratio of the standard deviation of the added noise and that of input, are set to be 0.15 and ensemble number is set to be 100. Information of the feature sets is tabulated in 340 examples are then put into and other 292 examples are put into without their labels.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…Through EEMD process, the original vibration signals collected from the two cases are transformed into two feature sets. According to [18], two parameters of EEMD, that is, the ratio of the standard deviation of the added noise and that of input, are set to be 0.15 and ensemble number is set to be 100. Information of the feature sets is tabulated in 340 examples are then put into and other 292 examples are put into without their labels.…”
Section: Resultsmentioning
confidence: 99%
“…In addition, it possesses the merits of good efficiency and generalization ability. Tritraining has been successfully applied in Chinese chunking [16], biomedical named entity recognition [17], and web spam detection [18]. With all these advantages and successful application in other areas, tritraining is supposed to be a promising method in bearing fault diagnosis too.…”
Section: Introductionmentioning
confidence: 99%