The Impact of the U.S. Census Disclosure Avoidance System on Redistricting and Voting Rights Analysis

  title={The Impact of the U.S. Census Disclosure Avoidance System on Redistricting and Voting Rights Analysis},
  author={Christopher T. Kenny and Shiro Kuriwaki and Cory McCartan and Evan T R Rosenman and Tyler Simko and Kosuke Imai},
The US Census Bureau plans to protect the privacy of 2020 Census respondents through its Disclosure Avoidance System (DAS), which attempts to achieve differential privacy guarantees by adding noise to the Census microdata. By applying redistricting simulation and analysis methods to DAS-protected 2010 Census data, we find that the protected data are not of sufficient quality for redistricting purposes. We demonstrate that the injected noise makes it impossible for states to accurately comply… 
On Heuristic Models, Assumptions, and Parameters
Six reasons why heuristic models, assumptions, and parameters may be hazardous to comprehensive analysis of computing are raised and argued they deserve deliberate consideration as researchers explain scientific work.
Balancing data privacy and usability in the federal statistical system.
This essay argues that the discussion of federal statistical system change has not given proper consideration to the reduced social benefits of data availability and their usability relative to the value of increased levels of privacy protection, and recommends that a more balanced benefit-cost framework should be used to assess these trade-offs.
Statistical Data Privacy: A Song of Privacy and Utility
The statistical foundations common to both SDC and DP are discussed, major developments in SDP are highlighted, and exciting open research problems in private inference are presented.
Differential Perspectives: Epistemic Disconnects Surrounding the U.S. Census Bureau’s Use of Differential Privacy
When the U.S. Census Bureau announced its intention to modernize its disclosure avoidance procedures for the 2020 Census, it sparked a controversy that is still underway. The move to differential
Differential Privacy and Swapping: Examining De-Identification's Impact on Minority Representation and Privacy Preservation in the U.S. Census
It is proved that the expected error of queries made on swapped demographic datasets is greater in sub-populations whose racial distributions differ more from the racial distribution of the global population, and that the probability that m unique entries exist in a sub-population shrinks exponentially as the sub- population size grows.
Assessing Statistical Disclosure Risk for Differentially Private, Hierarchical Count Data, with Application to the 2020 U.S. Decennial Census
We propose Bayesian methods to assess the statistical disclosure risk of data released under zero-concentrated differential privacy, focusing on settings with a strong hierarchical structure and


Creating open source composite geocoders: Pitfalls and opportunities
A suite of open‐source tools designed to standardize American addresses and geocode them are presented, culminating in a composite geocoder for the city of St. Louis, Missouri, which serves as a proof‐of‐concept for developing sophisticated geocoding processes.
censusxy: Access the U.S
  • Census Bureau’s Geocoding A.P.I. System,
  • 2021
Census TopDown: The Impacts of Differential Privacy on Redistricting
Based on a close look at reconstructed Texas data, reassuring evidence is found that TopDown will not threaten the ability to produce districts with tolerable population balance or to detect signals of racial polarization for Voting Rights Act enforcement.
Disclosure Avoidance in the Census Bureau's 2010 Demonstration Data Product
The differentially private Disclosure Avoidance System (DAS) used to prepare the 2010 Demonstration Data Product (DDP) is described and the policy decisions that underlie the DAS are described and how theDAS uses those policy decisions to produce differentiallyPrivate data.
Multi-Scale Merge-Split Markov Chain Monte Carlo for Redistricting
In this work, the state space is extended so that each district is defined by a hierarchy of trees, which improves the computational efficiency of the multi-scale algorithm.
Sequential Monte Carlo for Sampling Balanced and Compact Redistricting Plans
A new Sequential Monte Carlo (SMC) algorithm is presented that draws representative redistricting plans from a realistic target distribution of choice and can simultaneously incorporate several constraints commonly imposed in real-world redistricting problems, including equal population, compactness, and preservation of administrative boundaries.
How differential privacy will affect our understanding of health disparities in the United States
It is found that the implementation of differential privacy will produce dramatic changes in population counts for racial/ethnic minorities in small areas and less urban settings, significantly altering knowledge about health disparities in mortality.
Recombination: A family of Markov chains for redistricting
This paper sets up redistricting as a graph partition problem and introduces a new family of Markov chains called Recombination (or ReCom) on the space of graph partitions and presents evidence that ReCom mixes efficiently, especially in contrast to the slow-mixing Flip, and provides experiments that demonstrate its qualitative behavior.
A Merge-Split Proposal for Reversible Monte Carlo Markov Chain Sampling of Redistricting Plans
A Markov chain on redistricting plans that makes relatively global moves is described, designed to be usable as the proposal in a Markov Chain Monte Carlo (MCMC) algorithm.