2013
DOI: 10.1007/978-3-642-36973-5_18
|View full text |Cite
|
Sign up to set email alerts
|

Open-Set Classification for Automated Genre Identification

Abstract: Abstract. Automated Genre Identification (AGI) of web pages is a problem of increasing importance since web genre (e.g. blog, news, eshops, etc.) information can enhance modern Information Retrieval (IR) systems. The state-of-the-art in this field considers AGI as a closed-set classification problem where a variety of web page representation and machine learning models have intensively studied. In this paper, we study AGI as an open-set classification problem which better formulates the real world conditions o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
20
0

Year Published

2014
2014
2024
2024

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 22 publications
(22 citation statements)
references
References 11 publications
1
20
0
Order By: Relevance
“…To evaluate the performance of MCC, in addition to using the common majority voting (MV) [22] algorithm as the baseline method for comparison, we also implemented the Random Feature Subspacing Ensemble (RFSE) algorithm based on the latest work [31] without considering the case of an unknown genre to suit our purpose. We used the same features in all three methods.…”
Section: Vs Other Ensemble Methodsmentioning
confidence: 99%
See 3 more Smart Citations
“…To evaluate the performance of MCC, in addition to using the common majority voting (MV) [22] algorithm as the baseline method for comparison, we also implemented the Random Feature Subspacing Ensemble (RFSE) algorithm based on the latest work [31] without considering the case of an unknown genre to suit our purpose. We used the same features in all three methods.…”
Section: Vs Other Ensemble Methodsmentioning
confidence: 99%
“…Pritsos and Stamatatos [31] focused on the use of content information with di↵erent text representation methods. In addition, they examined base SVM learners and an ensemble of classifiers.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…One of the solutions to this problem is to classify web pages according to the genre of web pages [2]. Automating the identification of the genre of web pages has become an important area in the classification of web pages [3], as it can be used to improve the quality of web search results and to reduce the search time [4].…”
Section: Term Weighting Based On Index Of Genre For Web Page Genre CLmentioning
confidence: 99%