Plug‐in machine learning for partially linear mixed‐effects models with repeated measurements

Emmenegger, Corinne; Bühlmann, Peter

doi:10.1111/sjos.12639

Scandinavian J Statistics

2023

DOI: 10.1111/sjos.12639

|View full text |Cite

Plug‐in machine learning for partially linear mixed‐effects models with repeated measurements

Corinne Emmenegger

Peter Bühlmann

Abstract: Traditionally, spline or kernel approaches in combination with parametric estimation are used to infer the linear coefficient (fixed effects) in a partially linear mixed‐effects model for repeated measurements. Using machine learning algorithms allows us to incorporate complex interaction structures, nonsmooth terms, and high‐dimensional variables. The linear variables and the response are adjusted nonparametrically for the nonlinear variables, and these adjusted variables satisfy a linear mixed‐effects model … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2024

2025

Publication Types

Select...

Article2

Relationship

Self Cite0

Independent2

Authors

Journals

Cited by 2 publications

References 57 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

Double machine learning for partially linear mediation models with high-dimensional confounders

Yang,

Shao,

Liu

et al. 2025

Neurocomputing

View full text Add to dashboard Cite

Double machine learning for partially linear mediation models with high-dimensional confounders

Yang,

Shao,

Liu

et al. 2025

Neurocomputing

View full text Add to dashboard Cite

Sandwich boosting for accurate estimation in partially linear models for grouped data

Young,

Shah

2024

Journal of the Royal Statistical Society Series B: Statistical Methodology

View full text Add to dashboard Cite

We study partially linear models in settings where observations are arranged in independent groups but may exhibit within-group dependence. Existing approaches estimate linear model parameters through weighted least squares, with optimal weights (given by the inverse covariance of the response, conditional on the covariates) typically estimated by maximizing a (restricted) likelihood from random effects modelling or by using generalized estimating equations. We introduce a new ‘sandwich loss’ whose population minimizer coincides with the weights of these approaches when the parametric forms for the conditional covariance are well-specified, but can yield arbitrarily large improvements in linear parameter estimation accuracy when they are not. Under relatively mild conditions, our estimated coefficients are asymptotically Gaussian and enjoy minimal variance among estimators with weights restricted to a given class of functions, when user-chosen regression methods are used to estimate nuisance functions. We further expand the class of functional forms for the weights that may be fitted beyond parametric models by leveraging the flexibility of modern machine learning methods within a new gradient boosting scheme for minimizing the sandwich loss. We demonstrate the effectiveness of both the sandwich loss and what we call ‘sandwich boosting’ in a variety of settings with simulated and real-world data.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Plug‐in machine learning for partially linear mixed‐effects models with repeated measurements

Cited by 2 publications

References 57 publications

Double machine learning for partially linear mediation models with high-dimensional confounders

Double machine learning for partially linear mediation models with high-dimensional confounders

Sandwich boosting for accurate estimation in partially linear models for grouped data

Contact Info

Product

Resources

About