2018
DOI: 10.1109/tkde.2018.2820051
|View full text |Cite
|
Sign up to set email alerts
|

A Thorough Evaluation of Distance-Based Meta-Features for Automated Text Classification

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
17
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 28 publications
(17 citation statements)
references
References 20 publications
0
17
0
Order By: Relevance
“…Figure 1 shows the MetaLazy steps to predict a test instance t. For every test instance (1) MetaLazy selects some features (2) and creates new co-occurrence features (3). After that, MetaLazy selects the k most similar instances to t in the training set (4). In (5) the distances are used to create weights for each neighbor of t. And finally, in (6), MetaLazy fits a weaker classifier to predicts t.…”
Section: Weaker Classifier Selectionmentioning
confidence: 99%
See 3 more Smart Citations
“…Figure 1 shows the MetaLazy steps to predict a test instance t. For every test instance (1) MetaLazy selects some features (2) and creates new co-occurrence features (3). After that, MetaLazy selects the k most similar instances to t in the training set (4). In (5) the distances are used to create weights for each neighbor of t. And finally, in (6), MetaLazy fits a weaker classifier to predicts t.…”
Section: Weaker Classifier Selectionmentioning
confidence: 99%
“…For topic categorization, we have the following benchmark datasets: (1) 20 Newsgroups (20ng); (2) 4 Universities (4UNI), a.k.a. WebKB; (3) ACM Digital Library (ACM); (4) and Reuters90. For the sentiment analysis task, we have Yelp Reviews, which consists of a set of business and services reviews and Twitter Stanford, an automatically created dataset based on a set of known positive and negative emoticons.…”
Section: Datasetsmentioning
confidence: 99%
See 2 more Smart Citations
“…1,2 Recently, many works have been proposed for the TC. Canuto et al 3 presented a use of a multiobjective optimization strategy to reduce the number of metafeatures while maximizing the classification effectiveness. Do and Poulet 4 proposed a novel fast and accurate parallel local support vector machine (SVM) algorithm for classifying very high-dimensional input spaces and large-scale multiclass data sets.…”
Section: Introductionmentioning
confidence: 99%