2018
DOI: 10.1111/biom.12888
|View full text |Cite
|
Sign up to set email alerts
|

Order Selection and Sparsity in Latent Variable Models via the Ordered Factor LASSO

Abstract: Generalized linear latent variable models (GLLVMs) offer a general framework for flexibly analyzing data involving multiple responses. When fitting such models, two of the major challenges are selecting the order, that is, the number of factors, and an appropriate structure for the loading matrix, typically a sparse structure. Motivated by the application of GLLVMs to study marine species assemblages in the Southern Ocean, we propose the Ordered Factor LASSO or OFAL penalty for order selection and achieving sp… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
21
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
7
2

Relationship

6
3

Authors

Journals

citations
Cited by 23 publications
(21 citation statements)
references
References 25 publications
0
21
0
Order By: Relevance
“…Finally, in our simulations we assumed that the true number of latent variables is known, when in practice this also has been chosen (e.g., Hui et al, 2018). How misspecifying the latent variable correlation structures affects the number of latent variables chosen is an avenue of future research to pursue, along with the more general topic of how misspecification of the latent variable correlation structure affects variable selection as well as other aspects of inference for GLLVMs as a whole (see Hoeting et al, 2006;Xu et al, 2015, for Simulation results for empirical bias (left column), root mean squared error (middle column), and coverage probability (right column) of regression coefficients β j , for normal response GLLVMs.…”
Section: Discussionmentioning
confidence: 99%
“…Finally, in our simulations we assumed that the true number of latent variables is known, when in practice this also has been chosen (e.g., Hui et al, 2018). How misspecifying the latent variable correlation structures affects the number of latent variables chosen is an avenue of future research to pursue, along with the more general topic of how misspecification of the latent variable correlation structure affects variable selection as well as other aspects of inference for GLLVMs as a whole (see Hoeting et al, 2006;Xu et al, 2015, for Simulation results for empirical bias (left column), root mean squared error (middle column), and coverage probability (right column) of regression coefficients β j , for normal response GLLVMs.…”
Section: Discussionmentioning
confidence: 99%
“…To date we have used the software to fit a dataset of size 174 × 985 in 61 minutes. In future work, we plan to generalize GLLVMs, as well as the package, so that it can handle spatial and or temporal correlation inherent in the data, as well as offer some data-driven forms of order and variable selection (see for example [43]).…”
Section: Discussionmentioning
confidence: 99%
“…Determining if species exhibit fully quadratic curves in response to ecological gradients, whether tolerances are common for all species per ecological gradient, or if the equal tolerances assumption is suited for a dataset, comes down to a problem of model selection for the quadratic GLLVM. To that end, future research can further investigate approaches such as regularization (e.g., possibly extending the approach of Hui et al 2018), hypothesis testing, or the use of confidence intervals of the quadratic coefficients. Similar to DCA, the quadratic GLLVM provides estimates of gradient length.…”
Section: Discussionmentioning
confidence: 99%