2012
DOI: 10.5705/ss.2010.216
|View full text |Cite
|
Sign up to set email alerts
|

Extended BIC for small-n-large-P sparse GLM

Abstract: SummaryThe small-n-large-P situation has become common in genetics research, medical studies, risk management, and other fields. Feature selection is crucial in these studies yet poses a serious challenge. The traditional criteria such as AIC, BIC, and crossvalidation choose too many features. To overcome the difficulties caused by the small-n-large-P situation, Chen and Chen (2008) developed a family of extended Bayes information criteria (EBIC). Under normal linear models, EBIC is found to be consistent with… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

2
139
0

Year Published

2012
2012
2017
2017

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 156 publications
(141 citation statements)
references
References 34 publications
2
139
0
Order By: Relevance
“…The selection consistency of EBIC for GLIMs with canonical links does not trivially pass to the case of non-canonical links. The selection consistency in the case of non-canonical links is established under more general conditions than those in Chen and Chen (2012). The conditions, though general, are naturally satisfied by many popular examples as given in Wedderburn (1976).…”
Section: Introductionmentioning
confidence: 99%
See 3 more Smart Citations
“…The selection consistency of EBIC for GLIMs with canonical links does not trivially pass to the case of non-canonical links. The selection consistency in the case of non-canonical links is established under more general conditions than those in Chen and Chen (2012). The conditions, though general, are naturally satisfied by many popular examples as given in Wedderburn (1976).…”
Section: Introductionmentioning
confidence: 99%
“…Conditions C2 and C3 are the same as conditions A2 and A3 in Chen and Chen (2012). Conditions C4-C5 reduce to conditions A4-A5 in Chen and Chen (2012) for canonical links.…”
Section: Selection Consistency Of Ebicmentioning
confidence: 99%
See 2 more Smart Citations
“…This results in datasets with small number of observations (small n) but a very high number of variables (large p). Since most of the statistical methods need sufficiently large number of observations to provide reliable estimates, such "long" data matrices lead to problematic computations [2]. Both the high dimensionality of the datasets and the "p ≫ n problem", pose big challenges for the analyst and the computational tools.…”
Section: Introductionmentioning
confidence: 99%