2019
DOI: 10.7554/elife.41690
|View full text |Cite
|
Sign up to set email alerts
|

Discovering and deciphering relationships across disparate data modalities

Abstract: Understanding the relationships between different properties of data, such as whether a genome or connectome has information about disease status, is increasingly important. While existing approaches can test whether two properties are related, they may require unfeasibly large sample sizes and often are not interpretable. Our approach, ‘Multiscale Graph Correlation’ (MGC), is a dependence test that juxtaposes disparate data science techniques, including k-nearest neighbors, kernel methods, and multiscale anal… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
29
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
5
1
1

Relationship

1
6

Authors

Journals

citations
Cited by 24 publications
(30 citation statements)
references
References 83 publications
(204 reference statements)
1
29
0
Order By: Relevance
“…(delta, r = 0.052, p = 0.19; theta, r = 0.078, p = 0.58; alpha, r = 0.103, p = 0.45; beta, r = 0.102, p = 0.89, broadband, r = 0.12, p = 0.75). We confirm these findings using three other measures of similarity, image intraclass correlation coefficient (I2C2) ( Shou et al, 2013 ), Multiscale Graph Correlation (MGC) algorithm ( Shen et al, 2018 ; Vogelsteinetal., 2019 ) and geodesic distance ( Venkatesh etal., 2020 ) (see Supplement ). Given these results and the distinct clusters of connectivity shown in Fig.…”
Section: Resultssupporting
confidence: 71%
See 1 more Smart Citation
“…(delta, r = 0.052, p = 0.19; theta, r = 0.078, p = 0.58; alpha, r = 0.103, p = 0.45; beta, r = 0.102, p = 0.89, broadband, r = 0.12, p = 0.75). We confirm these findings using three other measures of similarity, image intraclass correlation coefficient (I2C2) ( Shou et al, 2013 ), Multiscale Graph Correlation (MGC) algorithm ( Shen et al, 2018 ; Vogelsteinetal., 2019 ) and geodesic distance ( Venkatesh etal., 2020 ) (see Supplement ). Given these results and the distinct clusters of connectivity shown in Fig.…”
Section: Resultssupporting
confidence: 71%
“…The absolute median difference is driven by absolute differences in strength of connectivity at each link, while MDMR compares the similarity of patterns of connectivity within and between networks. Alternative analysis methods such as Joint and Individual Variation Explained (JIVE) ( Lock et al, 2013 ; Yu et al, 2017 ) or Multiscale Graph Correlation ( Shen et al, 2018 ; Vogelstein et al, 2019 ) give us similar results for MDMR ( Fig. A7 ).…”
Section: Discussionmentioning
confidence: 79%
“…However, the nature of such relationships at a specific resolution with phenotypic characteristics should be predefined as our framework only reveals the presence/absence of linear relationships. A recently introduced approach in (Vogelstein et al, ) not only allows identifying the existence of relationships with different natures, but also provides an adaptive framework to identify the most informative components (subsystems) that contribute to such relationships. However, unlike our framework, it may not be able to disentangle the relationships with phenotypic characteristics when (multiple overlapping) subsystems with complex interplays at different resolutions are included.…”
Section: Discussionmentioning
confidence: 99%
“…First, rather than operating on all pairwise distances, Discr only considers the subset that are between measurements of the same item, all other distances are literally discarded, as they do not provide information about the question of interest here. Second, Discr effectively operates on the ranks of distances, rather than the distances, rendering it robust to monotonic transformations of the data [28]. Figure 1 shows three different simulations illustrating the differences between Discr and other reliability statistics, including the fingerprinting index (Fingerprint) [29], intraclass correlation coefficient (ICC) [30], and Kernel [26] (see Appendix A for details).…”
Section: The Discriminability Statisticmentioning
confidence: 99%
“…Indeed, Figure 5 shows that, for virtually every single dataset including sex and age annotation (22 in total), a pipeline with higher Discr tends to preserve more information about both covariates. The amount of information is quantified by the effect size of the distance correlation DCorr (which is exactly equivalent to Kernel [28,39]), a statistic that quantifies the magnitude of association for both linear and nonlinear dependence structures. In contrast, if one were to use either PICC, Kernel, or I2C2…”
Section: Optimizing Discriminability Improves Downstream Inference Pementioning
confidence: 99%