2020
DOI: 10.1177/0962280220970228
|View full text |Cite
|
Sign up to set email alerts
|

Small sample sizes: A big data problem in high-dimensional data analysis

Abstract: In many experiments and especially in translational and preclinical research, sample sizes are (very) small. In addition, data designs are often high dimensional, i.e. more dependent than independent replications of the trial are observed. The present paper discusses the applicability of max t-test-type statistics (multiple contrast tests) in high-dimensional designs (repeated measures or multivariate) with small sample sizes. A randomization-based approach is developed to approximate the distribution of the m… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
21
0
1

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
3
1

Relationship

1
9

Authors

Journals

citations
Cited by 45 publications
(22 citation statements)
references
References 44 publications
0
21
0
1
Order By: Relevance
“…High data dimensionality in relation to sample size presents a challenge in controlling Type I error (Konietschke, Schwab, & Pauly, 2020). Thus, drawing robust inferences from a high-dimensional feature space containing 148 cortical regions (Destrieux Atlas) requires a sample size that is larger than the number of features.…”
Section: Introductionmentioning
confidence: 99%
“…High data dimensionality in relation to sample size presents a challenge in controlling Type I error (Konietschke, Schwab, & Pauly, 2020). Thus, drawing robust inferences from a high-dimensional feature space containing 148 cortical regions (Destrieux Atlas) requires a sample size that is larger than the number of features.…”
Section: Introductionmentioning
confidence: 99%
“…Due to the limited sample size of the discovery group A and the large number of proteins measured, the association of circulating proteins with DKD progression was only significant at the nominal p value threshold, with statistical significance lost after multiple testing. This is a well-known limitation of small sample sizes in high-dimensional data analysis (94), that we addressed in several ways. Critically, we confirmed the significant association of ANGPT2 with composite endpoint in an independent group of patients from C-PROBE (group B) and further confirmed the finding in the C-STRIDE cohort, using two different assay platforms.…”
Section: Discussionmentioning
confidence: 99%
“…Although multiple problems have been cited with the studies on a small sample,[ 21 22 23 ] many examples exist of useful studies on small samples. Some big discoveries have started with case series such as the dissemination of Kaposi sarcoma in young homosexuals[ 24 ] and pneumocystis pneumonia.…”
Section: Small Nmentioning
confidence: 99%