2011
DOI: 10.1016/j.patrec.2010.09.023
|View full text |Cite
|
Sign up to set email alerts
|

A sparse version of the ridge logistic regression for large-scale text categorization

Abstract: International audienceThe ridge logistic regression has successfully been used in text categorization problems and it has been shown to reach the same performance as the Support Vector Machine but with the main advantage of computing a probability value rather than a score. However, the dense solution of the ridge makes its use unpractical for large scale categorization. On the other side, LASSO regularization is able to produce sparse solutions but its performance is dominated by the ridge when the number of … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
20
0

Year Published

2012
2012
2023
2023

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 40 publications
(20 citation statements)
references
References 17 publications
0
20
0
Order By: Relevance
“…It has been proven to outperform traditional back-off smoothing, because the former has the ability to process unknown terms and also avoids over evaluating the conditional probability which is originally zero. In future, these kinds of works could be extended to evaluate relationship between sentences rather than words (Aseervatham, Antoniadis, Gaussier, Burlet, & Denneulin, 2011) The automatic text categorization is the process of assigning, one or more textual documents to predefined categories based on its contents. However, it encounters a problem when the number of features exceeds the number of observations.…”
Section: Logistic Regressionmentioning
confidence: 99%
See 1 more Smart Citation
“…It has been proven to outperform traditional back-off smoothing, because the former has the ability to process unknown terms and also avoids over evaluating the conditional probability which is originally zero. In future, these kinds of works could be extended to evaluate relationship between sentences rather than words (Aseervatham, Antoniadis, Gaussier, Burlet, & Denneulin, 2011) The automatic text categorization is the process of assigning, one or more textual documents to predefined categories based on its contents. However, it encounters a problem when the number of features exceeds the number of observations.…”
Section: Logistic Regressionmentioning
confidence: 99%
“…However, it encounters a problem when the number of features exceeds the number of observations. Also, ML techniques tend to perform weakly due to these overfitting problems; in which case, the model memorizes the training set instead of acquiring knowledge from them (Aseervatham et al, 2011). To prevent this, the complexity of the model has to be controlled during the training process using model selection techniques.…”
Section: Logistic Regressionmentioning
confidence: 99%
“…Since feature selection can be regarded as a binary regression task about each dimension of the original feature, a logistic regression function is used to denote a conditional probability model with the form defined byPfalse(w|l,f(x),bfalse)=)(1+exp)(l)(wTf(x)+b1where f(x) represents the original feature signature of voxel x, and w is the binary coefficient with 1 indicating that the corresponding features are relevant to the anatomical classification, and 0 denoting that the nonrelevant features are eliminated during the classifier learning process. l (·) is an anatomical binary labeling function.…”
Section: Methodsmentioning
confidence: 99%
“…Logistic Regression (LR) is a well-known statistical algorithm which was used widely in information retrieval [17][18][19][20][21][22]. LR was also investigated algorithm in English TC by some researchers [23][24][25][26][27][28][29][30][31][32][33][34][35][36].…”
Section: Introductionmentioning
confidence: 99%