Sigir ’94 1994
DOI: 10.1007/978-1-4471-2099-5_3
|View full text |Cite
|
Sign up to set email alerts
|

Towards Language Independent Automated Learning of Text Categorization Models

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
60
0
4

Year Published

1998
1998
2006
2006

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 87 publications
(64 citation statements)
references
References 4 publications
0
60
0
4
Order By: Relevance
“…Most methods for feature subset selection that are used information retrieval and text-learning (eg. [1], [3], [11]) are very simple compared to the methods developed in machine learning. Basically, some scoring measure that is used on a single feature is selected, a score is assigned to each feature independently, features are sorted according to the assigned score and a predefined number of the best features is taken to form the solution feature subset.…”
Section: F E a T U R E S U B S E T S E L E C T I O N Approachesmentioning
confidence: 99%
“…Most methods for feature subset selection that are used information retrieval and text-learning (eg. [1], [3], [11]) are very simple compared to the methods developed in machine learning. Basically, some scoring measure that is used on a single feature is selected, a score is assigned to each feature independently, features are sorted according to the assigned score and a predefined number of the best features is taken to form the solution feature subset.…”
Section: F E a T U R E S U B S E T S E L E C T I O N Approachesmentioning
confidence: 99%
“…Naïve Bayes, assume the features as independent components. Hence, rule learning based classifiers are context sensitive classifiers (Apte et al 1994;Cohen and Singer 1996).…”
Section: Rule Learning Based Classifiersmentioning
confidence: 99%
“…In this experiment we used the partition of Reuters-21450 that was prepared by Apté, Damerau, and Weiss (1994) for their experiments with the SWAP-1 rule learner. The Reuters-21450 corpus was partitioned into a training set of 7,789 documents and a test set containing 3,309 document.…”
Section: An Experiments With a Large Number Of Classesmentioning
confidence: 99%
“…(See, for instance, Apté, Damerau, & Weiss (1994), Biebricher et al (1988), Cohen & Singer (1996), Field (1975), Fuhr & Pfeifer (1994), Koller & Sahami (1997), Lewis & Ringuette (1994), Moulinier, Raškinis, & Ganascia (1996), Ng, Goh, & Low (1997) and Yang (1994)). It would be impossible for us to compare our algorithms to all of the previous methods.…”
Section: Introductionmentioning
confidence: 99%