2011
DOI: 10.1111/j.1467-9868.2011.01005.x
|View full text |Cite
|
Sign up to set email alerts
|

Variance Estimation Using Refitted Cross-Validation in Ultrahigh Dimensional Regression

Abstract: Summary Variance estimation is a fundamental problem in statistical modelling. In ultrahigh dimensional linear regression where the dimensionality is much larger than the sample size, traditional variance estimation techniques are not applicable. Recent advances in variable selection in ultrahigh dimensional linear regression make this problem accessible. One of the major problems in ultrahigh dimensional regression is the high spurious correlation between the unobserved realized noise and some of the predicto… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

5
229
0

Year Published

2011
2011
2024
2024

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 233 publications
(234 citation statements)
references
References 27 publications
5
229
0
Order By: Relevance
“…This decrease in the rate of convergence can be substantial for large p, for example if log p ∝ n γ for some positive γ < 1/2. This condition can be relaxed using the sample-splitting method of Fan, Guo, and Hao (2011), which is done in a supplementary appendix. Condition (v) is simply a set of sufficient conditions for consistent estimation of the variance of the double selection estimator.…”
Section: Condition Aste (P)mentioning
confidence: 99%
“…This decrease in the rate of convergence can be substantial for large p, for example if log p ∝ n γ for some positive γ < 1/2. This condition can be relaxed using the sample-splitting method of Fan, Guo, and Hao (2011), which is done in a supplementary appendix. Condition (v) is simply a set of sufficient conditions for consistent estimation of the variance of the double selection estimator.…”
Section: Condition Aste (P)mentioning
confidence: 99%
“…In particular, Table 1 refers to the case when lags of the predictand are not considered for forecasting using DI and the selection of the PCs is based on the p-values computed on the marginal t * i statistics; Table 2 refers to the case when p lags are considered and the selection is based on the marginal t * i statistics. Tables 3 and 4 deal with the selection based on the t i statistics with σ estimated according to the RCV method by Fan et al (2012): in 3 no lags of the predictands were considered, whereas in 4 they were.…”
Section: Resultsmentioning
confidence: 99%
“…We consider two implementations of the variable selection procedure, the first based on the marginal t * i statistics and the second based on the t i statistics computed using the Fan et al (2012) estimator of the regression error variance.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…This rate condition requires the conditional expectations to be sufficiently smooth so that a relatively small number of series terms can be used to approximate them well. As in the case of the IV estimator, this condition can be replaced with the weaker condition that s log(p ∨ n) = o(n) by employing a sample splitting method of Fan, Guo, and Hao (2011). This is done in a companion paper, which also deals with a more general setup, covering non-Gaussian, heteroscedastic disturbances (Belloni, Chernozhukov, and Hansen 2011).…”
Section: Inference On Treatment and Structural Effects Conditional Onmentioning
confidence: 99%