2023
DOI: 10.1007/s11113-023-09829-4
|View full text |Cite
|
Sign up to set email alerts
|

Assessing the Impact of Differential Privacy on Population Uniques in Geographically Aggregated Data: The Case of the 2020 U.S. Census

Yue Lin,
Ningchuan Xiao
Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4

Relationship

2
2

Authors

Journals

citations
Cited by 4 publications
(7 citation statements)
references
References 31 publications
0
7
0
Order By: Relevance
“…This can facilitate precise estimation and analysis of population patterns, trends, and changes at the local level [Lomax and Smith, 2017;Wu et al, 2022]. Such data can also be used to empower policymakers and planners to simulate and evaluate the impact of various policies, interventions, or scenarios on individual behavior and movement within urban or regional contexts [He et al, 2020;Lin and Xiao, 2023a;Papyshev and Yarime, 2021;Tanton et al, 2009], thus supporting applications in domains such as public health [Grefenstette et al, 2013;Spooner et al, 2021] and transportation planning [Hörl andBalac, 2021, Zhu andFerreira, 2014]. In addition, there is existing literature on enhancing synthetic population data by incorporating census variables with external data sources such as health and commercial surveys [Spooner et al, 2021;Morrissey et al, 2015].…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…This can facilitate precise estimation and analysis of population patterns, trends, and changes at the local level [Lomax and Smith, 2017;Wu et al, 2022]. Such data can also be used to empower policymakers and planners to simulate and evaluate the impact of various policies, interventions, or scenarios on individual behavior and movement within urban or regional contexts [He et al, 2020;Lin and Xiao, 2023a;Papyshev and Yarime, 2021;Tanton et al, 2009], thus supporting applications in domains such as public health [Grefenstette et al, 2013;Spooner et al, 2021] and transportation planning [Hörl andBalac, 2021, Zhu andFerreira, 2014]. In addition, there is existing literature on enhancing synthetic population data by incorporating census variables with external data sources such as health and commercial surveys [Spooner et al, 2021;Morrissey et al, 2015].…”
Section: Discussionmentioning
confidence: 99%
“…An optimization approach [Lin and Xiao, 2022; Lin and Xiao, 2023b; Lin, 2023] is used to construct the synthetic population data. We begin with a matrix representation of the individual-level population data that need to be synthesized.…”
Section: Optimization Modelingmentioning
confidence: 99%
See 1 more Smart Citation
“…Differential privacy is quite effective against the risk of reconstruction-abetted disclosure but the methodology is not fool proof. For example, Lin and Xiao (2023) have shown that the method, in this case the TopDown Algorithm (TDA) used by the U.S. Census Bureau, does not necessarily prevent the identification of population uniques using the public census tables, such as an individual belonging to a cell in a table that has a value of one. In particular, the probability of disclosure depends upon the size of the population, its composition and the number of attributes taken into account.…”
Section: Many Problems and One Solutionmentioning
confidence: 99%
“…In line with the Census Bureau's commitment to promoting public participation and ensuring “everyone counts,” measures have been taken to protect individual privacy by removing personal identifiers such as names and addresses from data releases. However, even without these identifiers, privacy concerns can still arise because the data contains information such as locations and demographic attributes that can be used to disclose the identities of individuals (Sweeney 2000; Lin and Harvey 2015; Lin and Xiao 2023a). An illustrative case is found in the research conducted by Abowd and Hawes (2023) on the 2010 U.S. Census data, which demonstrate that using a combination of census block, sex, and single year of age can differentiate a person from others in 44% of the national population.…”
Section: Introductionmentioning
confidence: 99%