2016
DOI: 10.1371/journal.pone.0151551
|View full text |Cite
|
Sign up to set email alerts
|

Discovering General Multidimensional Associations

Abstract: When two variables are related by a known function, the coefficient of determination (denoted R2) measures the proportion of the total variance in the observations explained by that function. For linear relationships, this is equal to the square of the correlation coefficient, ρ. When the parametric form of the relationship is unknown, however, it is unclear how to estimate the proportion of explained variance equitably—assigning similar values to equally noisy relationships. Here we demonstrate how to directl… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
23
0
6

Year Published

2017
2017
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 15 publications
(29 citation statements)
references
References 14 publications
0
23
0
6
Order By: Relevance
“…We then applied a generalized measure of association, the generalized correlation coefficient (GCC), to investigate the association between intra-pair difference of BMI and intra-pair difference of expression to control the genetic and shared environmental effects, as proposed by Tan et al [13]. GCC was computed using a ratio of maximum likelihoods for the marginal distribution and maximum weighted likelihoods for the joint distribution using the R package matie [14]. The mRNA probes with p<0.05 were used for gene set enrichment analysis (GSEA) to detect biological pathways over-represented by the listed probes for functional interpretation [15].…”
Section: Discussionmentioning
confidence: 99%
“…We then applied a generalized measure of association, the generalized correlation coefficient (GCC), to investigate the association between intra-pair difference of BMI and intra-pair difference of expression to control the genetic and shared environmental effects, as proposed by Tan et al [13]. GCC was computed using a ratio of maximum likelihoods for the marginal distribution and maximum weighted likelihoods for the joint distribution using the R package matie [14]. The mRNA probes with p<0.05 were used for gene set enrichment analysis (GSEA) to detect biological pathways over-represented by the listed probes for functional interpretation [15].…”
Section: Discussionmentioning
confidence: 99%
“…By providing the amount of information one variable reveals about another, MI measures the dependency between two variables of any type. Based on the concept of rank correlation, Murrell et al [15] very recently proposed a generalized correlation coefficient for non-parametric measurement of association between variables. The association score, A, ranges from 0 (when the variables are independent) to 1 (when they are perfectly associated).…”
Section: Methodsmentioning
confidence: 99%
“…Under the normal assumption in least squares regression, the squared deviations can be expressed as probability density so that . The last part of the formula (the proportion of unexplained variance) is the geometric mean of the squared ratio of the probability of observing a data point under the null model over the probability of that data point under the alternative model [15]. Here, the formulation of correlation depends only on the ratio of the probability density between two models which do not necessarily require normality assumption.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations