2022
DOI: 10.48550/arxiv.2203.00081
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Asymptotic Normality of Gini Correlation in High Dimension with Applications to the K-sample Problem

Abstract: The categorical Gini correlation proposed by Dang et al. [6] is a dependence measure between a categorical and a numerical variables, which can characterize independence of the two variables. The asymptotic distributions of the sample correlation under the dependence and independence have been established when the dimension of the numerical variable is fixed. However, its asymptotic distribution for high dimensional data has not been explored. In this paper, we develop the central limit theorem for the Gini c… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 31 publications
(67 reference statements)
0
1
0
Order By: Relevance
“…However, for univariate X (l) with dimension q l = 1, a simple fast algorithm for (4) only costs O(n log n). This makes the marginal feature screening appealing even for large n. Dang et al [5] and Sang and Dang [26] have explored the properties of the GDC in low-dimensional and highdimensional settings, respectively. Compared with the popular distance correlation [30,19], the GDC is more straightforward to perform statistical inference and it is more robust to deal with unbalanced data.…”
Section: Gini Distance Correlationmentioning
confidence: 99%
“…However, for univariate X (l) with dimension q l = 1, a simple fast algorithm for (4) only costs O(n log n). This makes the marginal feature screening appealing even for large n. Dang et al [5] and Sang and Dang [26] have explored the properties of the GDC in low-dimensional and highdimensional settings, respectively. Compared with the popular distance correlation [30,19], the GDC is more straightforward to perform statistical inference and it is more robust to deal with unbalanced data.…”
Section: Gini Distance Correlationmentioning
confidence: 99%