2021
DOI: 10.1111/biom.13553
|View full text |Cite
|
Sign up to set email alerts
|

Simultaneous feature selection and outlier detection with optimality guarantees

Abstract: Biomedical research is increasingly data rich, with studies comprising ever growing numbers of features. The larger a study, the higher the likelihood that a substantial portion of the features may be redundant and/or contain contamination (outlying values). This poses serious challenges, which are exacerbated in cases where the sample sizes are relatively small. Effective and efficient approaches to perform sparse estimation in the presence of outliers are critical for these studies, and have received conside… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
17
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6

Relationship

2
4

Authors

Journals

citations
Cited by 12 publications
(20 citation statements)
references
References 60 publications
2
17
0
Order By: Relevance
“…Nevertheless, nowadays it can be solved effectively and at times also efficiently with specialized solvers. Importantly, it relates to the use of a trimmed loss function as in [10], and it extends the work in [7] for sparse linear regression models affected by data contamination in the form of mean-shift outliers. However, here the use of a nonlinear and nonquadratic objective function complicates the matter and requires special attention.…”
Section: Miprob: Robust Variable Selection Under the Logistic Slippage Modelmentioning
confidence: 90%
See 4 more Smart Citations
“…Nevertheless, nowadays it can be solved effectively and at times also efficiently with specialized solvers. Importantly, it relates to the use of a trimmed loss function as in [10], and it extends the work in [7] for sparse linear regression models affected by data contamination in the form of mean-shift outliers. However, here the use of a nonlinear and nonquadratic objective function complicates the matter and requires special attention.…”
Section: Miprob: Robust Variable Selection Under the Logistic Slippage Modelmentioning
confidence: 90%
“…For instance, an ensemble method based on existing heuristic and robust procedures to create suitable big-M bounds was considered in [7]. However, a similar approach is challenging in this framework given a "pool" of openly available robust algorithms is not available for logistic regression models-unlike in linear regression.…”
Section: Algorithmic Implementationmentioning
confidence: 99%
See 3 more Smart Citations