2019
DOI: 10.12688/f1000research.16216.2
|View full text |Cite
|
Sign up to set email alerts
|

Feature selection with the R package MXM

Abstract: Feature (or variable) selection is the process of identifying the minimal set of features with the highest predictive performance on the target variable of interest. Numerous feature selection algorithms have been developed over the years, but only few have been implemented in R and made publicly available R as packages while offering few options. The R package MXM offers a variety of feature selection algorithms, and has unique features that make it advantageous over its competitors: a) it contains feature se… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
10
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
7
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 19 publications
(10 citation statements)
references
References 26 publications
0
10
0
Order By: Relevance
“…• Max-Min Parents and Children (MMPC): Incorporated in the MXM R-package [15], this algorithm conducts a constraint-based feature selection, assuming a Bayesian network for input variables. The permutation option was activated (R = 999) and the max_k value was specified as: sample size/10 for optimizing the performance.…”
Section: Dimension Reductionmentioning
confidence: 99%
See 1 more Smart Citation
“…• Max-Min Parents and Children (MMPC): Incorporated in the MXM R-package [15], this algorithm conducts a constraint-based feature selection, assuming a Bayesian network for input variables. The permutation option was activated (R = 999) and the max_k value was specified as: sample size/10 for optimizing the performance.…”
Section: Dimension Reductionmentioning
confidence: 99%
“…Being counterfactual to the common phenomenon of large-n, smallp contexts, classical statistical models are challenged by this unique structure. Various techniques proposed to address the curse of dimensionality in omics data, are dominated by feature selection workflows [12] and regularization techniques [13], while de-novo algorithms tailored to omics data have also been developed [14,15]. Although a single best method to deal with high-dimensionality does not exist, proposed approaches have unequivocally contributed to increasing the robustness of omics data analytics and biomarker discovery.…”
Section: Introductionmentioning
confidence: 99%
“…Therefore, in current biomarker research simple solutions such as NTP classifier became popular [ 125 , 126 ]. Moreover, sets of selected markers offer much better performance than single marker, therefore, currently in various studies predictive power have benefited from such multi-marker approach [ 127 ]. For example, in the study of Xing et al, three serum markers were evaluated as potentially clinically usable: CHI3L1, MMP13, and SPP1, from more than 4000 differentially expressed genes in the ESCC transcriptome database.…”
Section: Applications Of Transcriptome Analysis In Oncologymentioning
confidence: 99%
“…The predicted risk score for the patients was calculated based on the expression status of the prognostic genes, implemented in R with the predict.coxph function. Particularly, the features (prognostic genes) were selected by the Maximum Minimum Parents and Children (MMPC) algorithm (Lagani et al, 2016) and implemented by the MXM package in R.…”
Section: Cox Proportional Hazards Regression Analysismentioning
confidence: 99%