22nd International Conference on Advanced Information Networking and Applications - Workshops (Aina Workshops 2008) 2008
DOI: 10.1109/waina.2008.120
|View full text |Cite
|
Sign up to set email alerts
|

Improving Thai Academic Web Page Classification Using Inverse Class Frequency and Web Link Information

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2009
2009
2021
2021

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(4 citation statements)
references
References 11 publications
0
4
0
Order By: Relevance
“…Because the two factors in tf.idf model are both estimated on the document level, but the tf factor of tf.icf is estimated on the document level and the icf factor is estimated on the category level. Our tf.icf is different from previous tf.icf methods [25][26][27][28][29][30][31], which have the same name, but have different meanings. For example, Reed [25] use the abbreviation ICF (Inverse Corpus Frequency) in dealing with stream data, and ICF in Ref [25] is inverse document frequency of the whole corpus.…”
Section: Two Novel Term Weighting Schemes Based On Icfmentioning
confidence: 91%
See 2 more Smart Citations
“…Because the two factors in tf.idf model are both estimated on the document level, but the tf factor of tf.icf is estimated on the document level and the icf factor is estimated on the category level. Our tf.icf is different from previous tf.icf methods [25][26][27][28][29][30][31], which have the same name, but have different meanings. For example, Reed [25] use the abbreviation ICF (Inverse Corpus Frequency) in dealing with stream data, and ICF in Ref [25] is inverse document frequency of the whole corpus.…”
Section: Two Novel Term Weighting Schemes Based On Icfmentioning
confidence: 91%
“…Pei et al [16] proposed an improved tf.idf method, which combined tf.idf and information gain. Many variations of ICF have been used in document classification [26,27,30,31].…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…The results suggested that the bigram model of TFIDF with term distributions was a good model. [44] improved Thai-language academic web page classification by using inverse class frequency and web link information. They suggest that inverse class frequency should be used instead of inverse document frequency for centroid based text categorization.…”
Section: Literature Reviewmentioning
confidence: 99%