2008
DOI: 10.1108/10662240810862248
|View full text |Cite
|
Sign up to set email alerts
|

Classifying information sender of web documents

Abstract: Purpose -To develop a method for classifying information sender of web documents, which constitutes an important part of information credibility analysis. Design/methodology/approach -Machine learning approach was employed. About 2,000 human-annotated web documents were prepared for training and evaluation. The classification model was based on support vector machine, and the features used for the classification included the title and URL of documents, as well as information of the top page. Findings -With rel… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2008
2008
2012
2012

Publication Types

Select...
1
1

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 7 publications
0
1
0
Order By: Relevance
“…Page gathering and individual analysis modules based on natural language processing need to be automated and improved in the future. Currently, collecting Japanese pages (NICT, 2006) and sender classifications (Kato et al , 2007) are becoming automated. Other modules and data are scheduled to be updated accordingly.…”
Section: Discussionmentioning
confidence: 99%
“…Page gathering and individual analysis modules based on natural language processing need to be automated and improved in the future. Currently, collecting Japanese pages (NICT, 2006) and sender classifications (Kato et al , 2007) are becoming automated. Other modules and data are scheduled to be updated accordingly.…”
Section: Discussionmentioning
confidence: 99%