2009
DOI: 10.1111/j.1467-985x.2008.00574.x
|View full text |Cite
|
Sign up to set email alerts
|

Multiple Imputation for Combining Confidential Data Owned by Two Agencies

Abstract: Statistical agencies that own different databases on overlapping subjects can benefit greatly from combining their data. These benefits are passed on to secondary data analysts when the combined data are disseminated to the public. Sometimes combining data across agencies or sharing these data with the public is not possible: one or both of these actions may break promises of confidentiality that have been given to data subjects. We describe an approach that is based on two stages of multiple imputation that f… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
10
0

Year Published

2009
2009
2024
2024

Publication Types

Select...
5
3
1

Relationship

1
8

Authors

Journals

citations
Cited by 14 publications
(10 citation statements)
references
References 29 publications
0
10
0
Order By: Relevance
“…This approach can be adapted for more general add‐on scenarios. The approach was developed by Kohnen (2005) and illustrated by Kohnen & Reiter (2009). We assume no missing data in D 1 or D 2 ; methods for handling missing data and integration simultaneously are a subject for future research.…”
Section: Synthetic Data For Integration and Disseminationmentioning
confidence: 99%
See 1 more Smart Citation
“…This approach can be adapted for more general add‐on scenarios. The approach was developed by Kohnen (2005) and illustrated by Kohnen & Reiter (2009). We assume no missing data in D 1 or D 2 ; methods for handling missing data and integration simultaneously are a subject for future research.…”
Section: Synthetic Data For Integration and Disseminationmentioning
confidence: 99%
“…Or, Agency 1 might transform the variables with sensitive data to standard normal distributions, e.g. via Box–Cox transformations; see Kohnen & Reiter (2009) for an application of this approach. A third masking approach was described by Kohnen (2005).…”
Section: Synthetic Data For Integration and Disseminationmentioning
confidence: 99%
“…Recent developments in advanced disclosure avoidance techniques for microdata include noise infusion (Organisation for Economic Co‐operation and Development, ) and synthetic data (Machanavajjhala et al ., ; Reiter, ; Kohnen and Reiter, ). The first large‐scale use of noise infusion in any official statistical product occurred in 2003 on the US Census Bureau's QWIs (Abowd et al ., ).…”
Section: Introductionmentioning
confidence: 99%
“…Among the major limitations of this approach are that it relies on SRL, allows only datacustodians to analyse the microdata (i.e., non-data custodians cannot perform analysis) and that it is currently limited to a certain set of models. Alternatively, Kohnen and Reiter (2009) consider the novel problem of how data custodians, without sharing sensitive variables, can together produce synthetic linked microdata for public use. Limitations of this approach are that synthetic data can be time consuming to produce and that it can be hard to guarantee that the synthetic data do not distort some important relationships.…”
Section: Introductionmentioning
confidence: 99%