Christopher T. Kenny and Cory McCartan
The precinct-level demographic and election data from the 2020 decennial census and the Voting and Election Science Team which have been tidied and joined together using 2020 precinct boundaries. Where 2020 precinct boundaries are not available, Census block-level data is provided instead, and where no VEST data is available, only demographic information is provided. Code to generate the data from these sources is included; the entire workflow is open-source and reproducible.
The easiest way to get the data is to download it from our website. You can also download a ZIP of all the data here.
However, if you want to work with a specific set of states, or wish to join the data
to a precinct shapefile, you can use the alarmdata
package.
Please make sure to cite the Voting and Election Science Team (CC-4.0) and the U.S. Census Bureau. Consult the license for information on modifying and sharing the data and/or code.
- For redistricting and voting rights analysis, we recommend the
redist
package. - For pre-processing and tidying data for redistricting analysis, we recommend the
geomander
package. - For more custom tabulations of the 2020 census data, we recommend the
PL94171
package. - For general-purpose census data processing, we recommend the
censable
package. - For alternate data unaffected by Census differential privacy, you may want to
consider FCC block-level estimates, available using the
blockpop
package.
Each data table contains several identification columns, a set of census-derived demographic columns, and a set of VEST-derived election columns.
-
GEOID20
is the unique identifier for a precinct or Census block. Thestate
andcounty
of the precinct or block are also provided. -
Census variables are prefixed with
pop_
orvap_
, depending on whether they are for the entire population or the voting-age population. Suffixes refer to racial and ethnic categories, as follows:_hisp
: Hispanic or Latino (of any race)_white
: White alone, not Hispanic or Latino_black
: Black or African American alone, not Hispanic or Latino_aian
: American Indian and Alaska Native alone, not Hispanic or Latino_asian
: Asian alone, not Hispanic or Latino_nhpi
: Native Hawaiian and Other Pacific Islander alone, not Hispanic or Latino_other
: Some Other Race alone, not Hispanic or Latino_two
: Population of two or more races, not Hispanic or Latino
-
Election variables consist of average vote counts for Democratic and Republican candidates. The
adv_##
andarv_##
columns report the average vote count in the##
year election, across all statewide races contested by both parties. Thendv
andnrv
columns further average the vote counts across all available election years. For specific statewide races, you may download the files invest-2020/
and join them to the data using theGEOID20
column. Additional election data is provided with the following naming convention:off_yr_par_can
where:off
indicates the three letter office abbreviation. Possible choices are:pre
: Presidentuss
: United States Senategov
: Governoratg
: Attorney Generalsos
: Secretary of State
yr
indicates the year of the electionpar
inidcates the partyrep
: Republicandem
: Democratic
can
indicates the first three letters of the candidate's last name
To produce election data using 2020 precinct boundaries, election results were projected down to the 2010 block level using voting-age population as weights. Results for 2020 blocks were then estimated using 2010 blocks and the land-use-based crosswalk files from VEST. Finally, 2020 blocks were aggregated to 2020 Census VTDs using the Census' 2020 block assignment files.