2011
DOI: 10.1016/j.eswa.2010.08.100
|View full text |Cite
|
Sign up to set email alerts
|

Using chi-square statistics to measure similarities for text categorization

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
43
0
1

Year Published

2013
2013
2022
2022

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 95 publications
(44 citation statements)
references
References 12 publications
(18 reference statements)
0
43
0
1
Order By: Relevance
“…After every accident investigation report has been decomposed using HFACS framework, the frequency and marginal frequency of every factor can be calculated. First, the χ 2 (Chi-square) (Chen, Y., Chen, M. C. 2011) is used to test for independence, where we assume that represents the hypothesis that human factors of HFACS are independent of each other; states the human factors of HFACS are dependent and correlated to each other: N N N N N N N N N N N N N …”
Section: Overview Of Analysis Methods Rationalementioning
confidence: 99%
“…After every accident investigation report has been decomposed using HFACS framework, the frequency and marginal frequency of every factor can be calculated. First, the χ 2 (Chi-square) (Chen, Y., Chen, M. C. 2011) is used to test for independence, where we assume that represents the hypothesis that human factors of HFACS are independent of each other; states the human factors of HFACS are dependent and correlated to each other: N N N N N N N N N N N N N …”
Section: Overview Of Analysis Methods Rationalementioning
confidence: 99%
“…Filters are computationally simple and fast. Some of the popular filter approaches are mutual information [14], chisquare [15], and information gain [16].…”
Section: Dimensionality Reductionmentioning
confidence: 99%
“…After features, that well model the problem, have been defined, machine learning algorithms need to be chosen and in the field of miRNA detection supervised methods have been applied widely (see Chapters 10,12,[15][16][17][18].…”
Section: What Are Features?mentioning
confidence: 99%
“…The information gain of term "w" is: (2) ranges from 1 to 4. 2)Chi-Square Statistic Chi-square statistic ( ) quantify the importance of the feature terms by estimating the correlation between terms and classes [10]. The terms should be selected with the highest correlation.…”
Section: Select the Feature Termsmentioning
confidence: 99%