2022
DOI: 10.1111/biom.13672
|View full text |Cite
|
Sign up to set email alerts
|

Model-Based Clustering of High-Dimensional Longitudinal Data via Regularization

Abstract: We propose a model‐based clustering method for high‐dimensional longitudinal data via regularization in this paper. This study was motivated by the Trial of Activity in Adolescent Girls (TAAG), which aimed to examine multilevel factors related to the change of physical activity by following up a cohort of 783 girls over 10 years from adolescence to early adulthood. Our goal is to identify the intrinsic grouping of subjects with similar patterns of physical activity trajectories and the most relevant predictors… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
8
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
2

Relationship

2
3

Authors

Journals

citations
Cited by 7 publications
(8 citation statements)
references
References 31 publications
0
8
0
Order By: Relevance
“…It is obvious that these other approaches fail to consider the relationship between the covariates and the outcome of interest during the clustering process as they only make such a connection after the clustering has been performed. In addition, limited work has been done on the clustering of longitudinal high-dimensional data [ 19 ]. The longitudinal latent class analysis may be used to consider the correlation inherent in time-dependent, repeated-measure observations, with the limitation that all time points must be identical across subjects.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…It is obvious that these other approaches fail to consider the relationship between the covariates and the outcome of interest during the clustering process as they only make such a connection after the clustering has been performed. In addition, limited work has been done on the clustering of longitudinal high-dimensional data [ 19 ]. The longitudinal latent class analysis may be used to consider the correlation inherent in time-dependent, repeated-measure observations, with the limitation that all time points must be identical across subjects.…”
Section: Discussionmentioning
confidence: 99%
“…In this paper, we consider the clustering of correlated observations with categorical outcomes and high-dimensional microbiome data. The method used here is a novel, non-trivial extension of the one detailed in Yang and Wu [ 19 ], which is a mixture-model-based clustering method for longitudinal data with regularization to enforce a variable selection of high-dimensional covariates. We extend this by considering categorical outcomes as opposed to Gaussian outcomes, which pose unique challenges of their own, as the Yang and Wu method only considers Gaussian outcomes.…”
Section: Discussionmentioning
confidence: 99%
“…A novel one-step procedure [ 33 ] was used to perform clustering and variable selection simultaneously via a mixture of linear mixed-effects models with shrinkage penalties on both fixed effects and random effects.…”
Section: Discussionmentioning
confidence: 99%
“…As one can see from the discussion above, post-selection inference in clustering methods is even more challenging than the inference problem for single models because one should consider not only the within-cluster but also between-cluster significance. In the paper of Yang and Wu [ 33 ], the authors proved that joint convergence rate of the fixed and random effects when both dimensions grow at an exponential rate of sample size within clusters (i.e., for a homogeneous population). Under the same setting, they also proved the sparsistency property.…”
Section: Discussionmentioning
confidence: 99%
“…A simultaneous penalised linear mixed model (SP-LMM), as implemented in the splmm R package, was fitted for each dataset. This involves a feature selection in a high-dimensional longitudinal setting [14]. A few metabolites and proteins with the largest absolute effect size were considered to be potentially associated with the CVD onset.…”
Section: Methodsmentioning
confidence: 99%