2015 IEEE 27th International Conference on Tools With Artificial Intelligence (ICTAI) 2015
DOI: 10.1109/ictai.2015.56
|View full text |Cite
|
Sign up to set email alerts
|

A Combined Approach for Filter Feature Selection in Document Classification

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
5
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 9 publications
(5 citation statements)
references
References 22 publications
0
5
0
Order By: Relevance
“…The feature filtering methods used in the study are document frequency (DF) together with term variance (TV). To exploit the advantages of two different FS methods, a hybridization of cluster-based and the frequency-based approach is presented by Nguyen and Bao [13]. The proposed method termed FCFS on comparison with its counterpart achieved the best performance in terms of micro-F1.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…The feature filtering methods used in the study are document frequency (DF) together with term variance (TV). To exploit the advantages of two different FS methods, a hybridization of cluster-based and the frequency-based approach is presented by Nguyen and Bao [13]. The proposed method termed FCFS on comparison with its counterpart achieved the best performance in terms of micro-F1.…”
Section: Related Workmentioning
confidence: 99%
“…One of the most crucial steps of the preprocessing of text data is the presentation of text documents into vector space via Bag_of_Word (BOW) [13][14] [15]. The final product of this task is associated with two main issues, a vast number of features representation, and the presence of irrelevant and noisy features which general termed high dimensionality [15].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…The Naive Bayes model is more efficient than that of the logistic regression scheme, nearest neighbour, decision tree and neural network, according to Receiver Operating Characteristic (ROC) curve, which is more significantly implemented in the research domain. The model is a simple, less parameterized and efficient one in terms of performance [11]. KNN is a special type of instance-based classification model, in which approximation function is computed locally and it continues until classification occurs.…”
Section: Related Workmentioning
confidence: 99%
“…Many strategies for classifying research articles have been presented in the literature. These techniques are characterized as citation, metadata, content-based, or hybrid techniques [2], [3], [4], [5], [6]. Metadata based approaches are important among them due to its nature (always free available online).…”
Section: Introductionmentioning
confidence: 99%