2018
DOI: 10.1002/gepi.22112
|View full text |Cite
|
Sign up to set email alerts
|

An analytic approach for interpretable predictive models in high‐dimensional data in the presence of interactions with exposures

Abstract: Predicting a phenotype and understanding which variables improve that prediction are two very challenging and overlapping problems in the analysis of high‐dimensional (HD) data such as those arising from genomic and brain imaging studies. It is often believed that the number of truly important predictors is small relative to the total number of variables, making computational approaches to variable selection and dimension reduction extremely important. To reduce dimensionality, commonly used two‐step methods f… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
4
3

Relationship

2
5

Authors

Journals

citations
Cited by 7 publications
(4 citation statements)
references
References 52 publications
0
4
0
Order By: Relevance
“…gene command from the ape package (vers 5.0; [100]) in R [101]. This genetic distance matrix was reduced to 2 dimensions by multidimensional scaling using the cmdscale and eclust commands [102]. These two dimensions were then used to hierarchically cluster the populations into 2 groups using kmeans clustering.…”
Section: Outlier Analysis Of Targeted Exome Re-sequencingmentioning
confidence: 99%
“…gene command from the ape package (vers 5.0; [100]) in R [101]. This genetic distance matrix was reduced to 2 dimensions by multidimensional scaling using the cmdscale and eclust commands [102]. These two dimensions were then used to hierarchically cluster the populations into 2 groups using kmeans clustering.…”
Section: Outlier Analysis Of Targeted Exome Re-sequencingmentioning
confidence: 99%
“…In the subsequent phase, we employ K-means clustering using ECLUST, 21 considering the cluster counts and centroids established in the initial phase. The resulting cluster are shown in Fig.…”
Section: Methodsmentioning
confidence: 99%
“…The input values for each putative interactor corresponded to the average (across all biological replicates) log 2 fold changes in the abundance of the putative interactor in cells expressing a bait enzyme relative to the parental HeLa cells. Hierarchical clustering was performed on the centered and scaled input values with the “eclust” package ( 65 ) using the “ward.D2” linkage and the “Euclidean” distance metric. For visualization, the “ComplexHeatmap” package ( 66 ) was used in R.…”
Section: Methodsmentioning
confidence: 99%