2020
DOI: 10.2478/jos-2020-0004
|View full text |Cite
|
Sign up to set email alerts
|

Statistical Challenges in Combining Survey and Auxiliary Data to Produce Official Statistics

Abstract: Combining survey and auxiliary data to produce official statistics is gaining interest at federal agencies and among policy makers due to its efficiency. Recent studies have shown the practicality of small area estimation modeling approaches in the context of integrating data from multiple sources to improve estimation at fine levels of aggregation. In this article, agricultural predictions are constructed using a hierarchical Bayes subarea-level model, fit to data available from different sources. Auxiliary d… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
18
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
7
1

Relationship

4
4

Authors

Journals

citations
Cited by 13 publications
(18 citation statements)
references
References 12 publications
0
18
0
Order By: Relevance
“…For future investigations, we recommend exploring even finer domains (for example, with geography defined by states, counties, or metropolitan areas) and other data sources to help refine the prediction space (see, e.g., Erciulescu et al, 2020). The predictions could also be improved if good quality auxiliary data could be obtained and harmonised with the NCS survey data, to construct additional covariates for the small domain models; any such data would have to be available and allowed to be used in the production environment of the agency.…”
Section: Discussionmentioning
confidence: 99%
“…For future investigations, we recommend exploring even finer domains (for example, with geography defined by states, counties, or metropolitan areas) and other data sources to help refine the prediction space (see, e.g., Erciulescu et al, 2020). The predictions could also be improved if good quality auxiliary data could be obtained and harmonised with the NCS survey data, to construct additional covariates for the small domain models; any such data would have to be available and allowed to be used in the production environment of the agency.…”
Section: Discussionmentioning
confidence: 99%
“…In some cases, the survey may not indicate any planted area with respect to a particular commodity, but administrative data might represent some positive acres or vice versa. Erciulescu et al (2020) used the nearest neighbor methods to impute missing data for either survey or covariates. This approach of imputing and borrowing information from previous year or the average of several years estimates are being explored.…”
Section: Discussionmentioning
confidence: 99%
“…In this paper, models with constraints are considered based on Nandram et al (2022). For comparison, the model without constraints are from Erciulescu et al (2020).…”
Section: Modelsmentioning
confidence: 99%
“…Stats 2022, 5 and can be applied to surveys suffering from small area issues, such as unavailable or unreliable survey summaries due to the volatility of sampling variances associated with the survey design. For ease of understanding, we illustrate the two methods with data from the United States Department of Agriculture's (USDA's) National Agricultural Statistics Service (NASS) crops county estimates program.…”
Section: Introductionmentioning
confidence: 99%
“…Yield is defined as the ratio of total production to the harvested acreage. Starting in 2020, several HB subarea-level (small area) models have been implemented as extensions of the FH model to improve the precision of the estimates at the county level (see [2][3][4][5][6]). The sampling variances of the yield estimates are produced using a second-order Taylor series approximation and, due to various reasons (e.g., sparseness in data), could result in zero, very small or very large estimated variances for several counties.…”
Section: Introductionmentioning
confidence: 99%