Nonparametric independence screening and structure identification for ultra-high dimensional longitudinal data

Cheng, Ming‐Yen; Honda, Toshio; Li, Jialiang; Peng, Heng

doi:10.1214/14-aos1236

Cited by 67 publications

(62 citation statements)

References 37 publications

(87 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Similar requirements can be found in [6,7,9] for screening in ultra-high dimensional varying coefficient models.…”

Section: Theoretical Propertiesmentioning

confidence: 65%

“…It has been well known that results from a single SIS procedure are rather crude (see [5,7,9,10]). In implementation, we don't directly determine threshold parameters ν n in (14) and ς n in (18) However, these approaches cannot guarantee that the selected set is exactly the same as the truly active set.…”

Section: Two-stage Approachesmentioning

confidence: 99%

“…Recently, many excellent variable screening methods for nonparametric models especially for varying coefficient models were presented in the literature [6][7][8][9][10]31]. We highlight a few relevant works for the marginal varying coefficient model…”

Section: Introductionmentioning

confidence: 99%

“…Note that both [9] and [7] considered B-spline approximation for the coefficient functions, while [10] used kernel smoothing technique. In these works, the SIS properties were rigorously established.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Conditional quantile correlation learning for ultrahigh dimensional varying coefficient models and its application in survival analysis

Xia¹,

Li²,

Fu³

2018

STAT SINICA

Self Cite

View full text Add to dashboard Cite

In this paper, we consider a robust approach to the ultrahigh dimensional variable screening under varying coefficient models. Different from the existing works focusing on the mean regression function, we propose a novel procedure based on the conditional quantile correlation sure independent screening (CQCSIS). This new proposal is applicable to heterogeneous or heavy-tailed data in general and is invariant to monotone transformation of the response. Furthermore, we generalize such a screening procedure to address censored lifetime data through inverse probability weighting. The CQCSIS can be easily implemented due to an application of nonparametric B-spline approxi-1 Statistica Sinica: Newly accepted Paper (accepted version subject to English editing) mation, and computed much faster than the kernel based screening method. Under some regularity conditions, we establish sure screening properties including screening consistency and ranking consistency for proposed approaches. In this paper we also attempt to construct a two-stage variable selection procedure for a further improvement of performance of CQCSIS based on a group SCAD penalization. Extensive simulation examples and real data applications are presented for illustration.

show abstract

“…Similar requirements can be found in [6,7,9] for screening in ultra-high dimensional varying coefficient models.…”

Section: Theoretical Propertiesmentioning

confidence: 65%

Section: Two-stage Approachesmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Conditional quantile correlation learning for ultrahigh dimensional varying coefficient models and its application in survival analysis

Xia¹,

Li²,

Fu³

2018

STAT SINICA

Self Cite

View full text Add to dashboard Cite

show abstract

“…For example, Li and Zhang (2011) proposed a new semiparametric threshold model for censored longitudinal data analysis. Cheng, et al (2014) offered a new automatic procedure for finding a sparse semivarying coefficient model, which is widely accepted for longitudinal data analysis. This paper intends to fill this gap.…”

Section: Introductionmentioning

confidence: 99%

Feature Screening in Ultrahigh Dimensional Generalized Varying-coefficient Models

Yang¹,

Yang

2020

STAT SINICA

View full text Add to dashboard Cite

Generalized varying coefficient models are particularly useful for examining dynamic effects of covariates on a continuous, binary or count response. This paper is concerned with feature screening for generalized varying coefficient models with ultrahigh dimensional covariates. The proposed screening procedure is based on joint quasi-likelihood of all predictors, and therefore is distinguished from marginal screening procedures proposed in the literature. In particular, the proposed procedure can effectively identify active predictors that are jointly dependent but marginally independent of the response. In order to carry out the proposed procedure, we propose an effective algorithm and establish the ascent property of the proposed algorithm. We further prove that the proposed procedure possesses the sure screening property. That is, with probability tending to one, the selected variable set includes the actual active predictors. We examine the finite sample performance of the proposed procedure and compare it with existing ones via Monte Carlo simulations, and illustrate the proposed procedure by a real data example.

show abstract

Sure Independence Screening

Fan

2018

Wiley StatsRef: Statistics Reference Online

View full text Add to dashboard Cite

Big data is ubiquitous in various fields of sciences, engineering, medicine, social sciences, and humanities. It is often accompanied by a large number of variables and features. While adding much greater flexibility to modeling with enriched feature space, ultrahigh‐dimensional data analysis poses fundamental challenges to scalable learning and inference with good statistical efficiency. Sure independence screening is a simple and effective method to this endeavor. This framework of two‐scale statistical learning, consisting of large‐scale screening followed by moderate‐scale variable selection introduced in Fan and Lv (2008), has been extensively investigated and extended to various model settings ranging from parametric to semiparametric and nonparametric for regression, classification, and survival analysis. This article provides an overview of the developments of sure independence screening over the past decade. These developments demonstrate the wide applicability of the sure independence screening‐based learning and inference for big data analysis with desired scalability and theoretical guarantees.

show abstract

Nonparametric independence screening and structure identification for ultra-high dimensional longitudinal data

Cited by 67 publications

References 37 publications

Conditional quantile correlation learning for ultrahigh dimensional varying coefficient models and its application in survival analysis

Conditional quantile correlation learning for ultrahigh dimensional varying coefficient models and its application in survival analysis

Feature Screening in Ultrahigh Dimensional Generalized Varying-coefficient Models

Sure Independence Screening

Contact Info

Product

Resources

About