Listing Thumbnail

    2020 Census Redistricting Data (P.L. 94-171) Noisy Measurement File

     Info
    Open data
    |
    Deployed on AWS
    The 2020 Census Redistricting Data (P.L. 94-171) Noisy Measurement File (NMF) is an intermediate output of the 2020 Census Disclosure Avoidance System (DAS) TopDown Algorithm (TDA) (as described in Abowd, J. et al [2022] https://doi.org/10.1162/99608f92.529e3cb9, and implemented in the [DAS 2020 Redistricting Production Code](https://github.com/uscensusbureau/DAS_2020_Redistricting_Production_Code)). The NMF was generated using [the Census Bureau's implementation](https://github.com/uscensusbureau/DAS_2020_Redistricting_Production_Code/blob/289ee463936a6f0efcf2e378abe410ec01d0e140/source/programs/engine/primitives.py#L183) of the [Discrete Gaussian Mechanism](https://arxiv.org/abs/2004.00010), calibrated to satisfy [zero-Concentrated Differential Privacy](https://arxiv.org/abs/1605.02065) with [bounded neighbors](https://dl.acm.org/doi/10.1145/1989323.1989345). <br/> <br/> The NMF values, called **noisy measurements** are the output of applying the Discrete Gaussian Mechanism to[...]

    Overview

    The 2020 Census Redistricting Data (P.L. 94-171) Noisy Measurement File (NMF) is an intermediate output of the 2020 Census Disclosure Avoidance System (DAS) TopDown Algorithm (TDA) (as described in Abowd, J. et al [2022] https://doi.org/10.1162/99608f92.529e3cb9 , and implemented in the DAS 2020 Redistricting Production Code ). The NMF was generated using the Census Bureau's implementation  of the Discrete Gaussian Mechanism , calibrated to satisfy zero-Concentrated Differential Privacy  with bounded neighbors .

    The NMF values, called noisy measurements are the output of applying the Discrete Gaussian Mechanism to counts from the 2020 Census Edited File (CEF). They are generally inconsistent with one another (for example, in a county composed of two tracts, the noisy measurement for the county's total population may not equal the sum of the noisy measurements of the two tracts' total population), and frequently negative (especially when the population being measured was small), but are integer-valued. The NMF was later post-processed as part of the DAS code to take the form of microdata and to satisfy various constraints. The NMF documented here contains both the noisy measurements themselves as well as the data needed to represent the DAS constraints; thus, the NMF could be used to reproduce the steps taken by the DAS code to produce microdata from the noisy measurements by applying the production code base .

    The 2020 Census Redistricting Data (P.L. 94-171) Noisy Measurement File includes zero-Concentrated Differentially Private (zCDP) (Bun, M. and Steinke, T [2016]) noisy measurements, implemented via the discrete Gaussian mechanism. These are estimated counts of individuals and housing units included in the 2020 Census Edited File (CEF), which includes confidential data initially collected in the 2020 Census of Population and Housing. The noisy measurements included in this file were subsequently post-processed by the TopDown Algorithm (TDA) to produce the 2020 Census Redistricting Data (P.L. 94-171) Summary File .

    The NMF provides estimates of counts of persons in the CEF by various characteristics and combinations of characteristics including their reported race and ethnicity, whether they were of voting age, whether they resided in a housing unit or one of 7 group quarters types, and their census block of residence after the addition of discrete Gaussian noise (with the scale parameter determined by the privacy-loss budget allocation for that particular query under zCDP). Noisy measurements of the counts of occupied and vacant housing units by census block are also included. Lastly, data on constraints--information into which no noise was infused by the Disclosure Avoidance System (DAS) and used by the TDA to post-process the noisy measurements into the 2020 Census Redistricting Data (P.L. 94-171) Summary File --are provided.

    Features and programs

    Open Data Sponsorship Program

    This dataset is part of the Open Data Sponsorship Program, an AWS program that covers the cost of storage for publicly available high-value cloud-optimized datasets.

    Pricing

    This is a publicly available data set. No subscription is required.

    How can we make this page better?

    We'd like to hear your feedback and ideas on how to improve this page.
    We'd like to hear your feedback and ideas on how to improve this page.

    Legal

    Content disclaimer

    Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.

    Usage information

     Info

    Delivery details

    AWS Data Exchange (ADX)

    AWS Data Exchange is a service that helps AWS easily share and manage data entitlements from other organizations at scale.

    Open data resources

    Available with or without an AWS account.

    How to use
    To access these resources, reference the Amazon Resource Name (ARN) using the AWS Command Line Interface (CLI). Learn more 
    Description
    The 2020 Census Redistricting Data (P.L. 94-171) Noisy Measurement File
    Resource type
    S3 bucket
    Amazon Resource Name (ARN)
    arn:aws:s3:::uscb-2020-product-releases/decennial/redistricting/2020/nmf/2020-pl94-nmf-state-partitioned
    AWS region
    us-west-2
    AWS CLI access (No AWS account required)
    aws s3 ls --no-sign-request s3://uscb-2020-product-releases/decennial/redistricting/2020/nmf/2020-pl94-nmf-state-partitioned/
    Description
    Census Open Data S3 Inventory
    Resource type
    S3 bucket
    Amazon Resource Name (ARN)
    arn:aws:s3:::uscb-opendata-inventory
    AWS region
    us-west-2
    AWS CLI access (No AWS account required)
    aws s3 ls --no-sign-request s3://uscb-opendata-inventory/

    Resources

    Support

    How to cite

    2020 Census Redistricting Data (P.L. 94-171) Noisy Measurement File was accessed on DATE from https://registry.opendata.aws/census-2020-pl94-nmf .

    License

    CC0 1.0 Universal

    Similar products