diff --git a/README.md b/README.md index 7d3108b..5666d8f 100644 --- a/README.md +++ b/README.md @@ -13,14 +13,16 @@ Replace '2021' with the current year throughout. 1. Install the GDAL library and python bindings and the unzip tool + ```bash # Ubuntu: sudo apt-get install python3-gdal python3-pip unzip pip3 install -r requirements.txt + ``` - 2. Get the TIGER 2022 data. You will need the EDGES files + 2. Get the TIGER 2023 data. You will need the EDGES files (3,235 zip files, 11GB total). - wget -r ftp://ftp2.census.gov/geo/tiger/TIGER2022/EDGES/ + wget -r ftp://ftp2.census.gov/geo/tiger/TIGER2023/EDGES/ 3. Convert the data into CSV files. Adjust the file paths in the scripts as needed @@ -28,14 +30,14 @@ Replace '2021' with the current year throughout. 4. Maybe: package the created files - tar -czf tiger2022-nominatim-preprocessed.csv.tar.gz tiger + tar -czf tiger2023-nominatim-preprocessed.csv.tar.gz tiger US Postcodes ------------- Addtionally create a `us_postcodes.csv.gz` file with centroid coordinates. - cat output-path/*.csv | ./calculate_postcode_centroids.py | gzip -9 > us_postcodes.csv.gz + cat tiger/*.csv | ./calculate_postcode_centroids.py | gzip -9 > us_postcodes.csv.gz License