2019
DOI: 10.1177/2378023118817378
|View full text |Cite
|
Sign up to set email alerts
|

Improving Metadata Infrastructure for Complex Surveys: Insights from the Fragile Families Challenge

Abstract: Researchers rely on metadata systems to prepare data for analysis. As the complexity of data sets increases and the breadth of data analysis practices grow, existing metadata systems can limit the efficiency and quality of data preparation. This article describes the redesign of a metadata system supporting the Fragile Families and Child Wellbeing Study on the basis of the experiences of participants in the Fragile Families Challenge. The authors demonstrate how treating metadata as data (i.e., releasing compr… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
8
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 10 publications
(8 citation statements)
references
References 34 publications
0
8
0
Order By: Relevance
“…It is important that the co-analysts understand not just the available dataset but also any ancillary information that might affect their analyses (e.g., prior exclusion of outliers or handling of missing data in the blinded dataset). Providing a codebook that is accessible and understandable for researchers with different backgrounds is essential ( Kindel et al, 2019 ).…”
Section: Multi-analyst Guidancementioning
confidence: 99%
“…It is important that the co-analysts understand not just the available dataset but also any ancillary information that might affect their analyses (e.g., prior exclusion of outliers or handling of missing data in the blinded dataset). Providing a codebook that is accessible and understandable for researchers with different backgrounds is essential ( Kindel et al, 2019 ).…”
Section: Multi-analyst Guidancementioning
confidence: 99%
“…It is important that the co-analysts understand not just the available dataset but also any ancillary information that might affect their analyses (e.g., prior exclusion of outliers or handling of missing data in the blinded dataset). Providing a codebook that is accessible and understandable for researchers with different backgrounds is essential (25).…”
Section: Providing the Datasetmentioning
confidence: 99%
“…They refer to their approach as "treating metadata as data" and suggest that the analysis of complex survey designs can be simplified if variable names are given a consistent, regular expressioncompatible nomenclature and are placed in a data set of metadata. They expect that analysts will use the data set of metadata to help them include many variables in a predictive model in a way that respects the structure of the data.Although the approach Kindel et al (2019) suggest is sound, it does not go far enough. The "metadata as data" approach builds the information from the survey's codebook-namely, response type, survey wave, and what type of respondent answered the question-into the data, but it does not encode tacit knowledge about the data set.…”
mentioning
confidence: 95%
“…Although the approach Kindel et al (2019) suggest is sound, it does not go far enough. The "metadata as data" approach builds the information from the survey's codebook-namely, response type, survey wave, and what type of respondent answered the question-into the data, but it does not encode tacit knowledge about the data set.…”
mentioning
confidence: 95%
See 1 more Smart Citation