2008
DOI: 10.1016/j.ipm.2007.11.002
|View full text |Cite
|
Sign up to set email alerts
|

Interactive high-quality text classification

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2009
2009
2017
2017

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 11 publications
(11 citation statements)
references
References 22 publications
0
11
0
Order By: Relevance
“…The identification is based on the χ 2 (chi‐square) statistics, which is popular in TC (Himmel, Reincke, & Michelmann, 2009; Liu, 2008; Yang & Pedersen, 1997). For a term t and a category c , χ 2 ( t,c )=[N×(A×D−B×C) 2 ]/[(A+B)×(A+C)×(B+D)×(C+D)], where N is the total number of training documents, A is the number of training documents that are in c and contain t , B is the number of training documents that are not in c but contain t , C is the number of training documents that are in c but do not contain t , and D is the number of training documents that are not in c and do not contain t (Liu, 2008; Yang & Pedersen, 1997). The term‐category correlation falls into two types: positively correlated type and negatively correlated type .…”
Section: Ctfamentioning
confidence: 99%
See 1 more Smart Citation
“…The identification is based on the χ 2 (chi‐square) statistics, which is popular in TC (Himmel, Reincke, & Michelmann, 2009; Liu, 2008; Yang & Pedersen, 1997). For a term t and a category c , χ 2 ( t,c )=[N×(A×D−B×C) 2 ]/[(A+B)×(A+C)×(B+D)×(C+D)], where N is the total number of training documents, A is the number of training documents that are in c and contain t , B is the number of training documents that are not in c but contain t , C is the number of training documents that are in c but do not contain t , and D is the number of training documents that are not in c and do not contain t (Liu, 2008; Yang & Pedersen, 1997). The term‐category correlation falls into two types: positively correlated type and negatively correlated type .…”
Section: Ctfamentioning
confidence: 99%
“…To make the acceptance and rejection decisions, the classifier needs to estimate the degree of acceptance (DOA) of each document d with respect to c (e.g., similarity between d and c , or probability of d belonging to c ). However, perfect DOA estimation cannot be expected (Liu, 2008; Zhang & Callan, 2001; Arampatzis, Beney, Koster, & Weide, 2000), mainly due to the difficulties in identifying and properly encoding all helpful TC evidence with limited computational resources (e.g., memory and training documents).…”
Section: Introductionmentioning
confidence: 99%
“…The test results are assessed by recall, precision and F-index [11]. Specific calculation formulas are shown as:…”
Section: Test and Analysismentioning
confidence: 99%
“…There are other text classification techniques that have explored in other applications such as in [19][20] [21].…”
Section: Related Workmentioning
confidence: 99%