2012
DOI: 10.18637/jss.v051.i10
|View full text |Cite
|
Sign up to set email alerts
|

Meta-Statistics for Variable Selection: TheRPackageBioMark

Abstract: Biomarker identification is an ever more important topic in the life sciences. With the advent of measurement methodologies based on microarrays and mass spectrometry, thousands of variables are routinely being measured on complex biological samples. Often, the question is what makes two groups of samples different. Classical hypothesis testing suffers from the multiple testing problem; however, correcting for this often leads to a lack of power. In addition, choosing α cutoff levels remains somewhat arbitrary… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
17
0

Year Published

2013
2013
2022
2022

Publication Types

Select...
8

Relationship

0
8

Authors

Journals

citations
Cited by 16 publications
(17 citation statements)
references
References 21 publications
0
17
0
Order By: Relevance
“…For the purpose, the aov function of the R package “stats” ( 7 ) was applied on the parameters expressed as ranks ( 10 ). To circumvent in part the dichotomous application of P values, as suggested by Greenland et al ( 27 ) and Wehrens and Franceschi ( 57 ), a P value trim limit of 0.1 was considered, in agreement with Kang et al ( 32 ).…”
Section: Methodsmentioning
confidence: 99%
“…For the purpose, the aov function of the R package “stats” ( 7 ) was applied on the parameters expressed as ranks ( 10 ). To circumvent in part the dichotomous application of P values, as suggested by Greenland et al ( 27 ) and Wehrens and Franceschi ( 57 ), a P value trim limit of 0.1 was considered, in agreement with Kang et al ( 32 ).…”
Section: Methodsmentioning
confidence: 99%
“…Finally, the findings from this study provide new results for variable selection methods, especially for the stability-based variable selection approach, which has not previously been extensively evaluated for LASSO and Elastic Net [5, 12]. The stability-based student t test and VIP scores outperformed Elastic Net and LASSO in all parameter configurations but not when the effect size and the number of variables were large.…”
Section: Discussionmentioning
confidence: 74%
“…The R package Biomark [5, 6] includes these popular variable selection methods: student t test, Variable Importance in Projection (VIP) scores [7, 8] from Partial Least Squares Regression (PLS-DA) models, Least Absolute Shrinkage and Selection Operator (LASSO) [9], and Elastic Net [10, 11]. Each method has different strengths and weaknesses for identifying significant variables often found in biological data like in metabolomics, and possibly, for modelling them.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…The R package spikeSlabGAM implements Bayesian variable selection via regularized estimation in additive mixed models (Scheipl, 2011). Dedicated to the biomarker identification in the life sciences, the R package BioMark implements two meta-statistics for variable selection (Wehrens et al, 2012): the first sets a data-dependent selection threshold for significance, which is useful when two groups are compared, and the second, more general one, uses repeated subsampling and selects the model coefficients remaining consistently important.…”
Section: Introductionmentioning
confidence: 99%