This is the general tracking location of the datasets used in this course. Will likely eventually move elsewhere to be stored in a much better way!
Note: for some of these files you need to push with:
git config --global http.postBuffer 500M
git config --global http.maxRequestBuffer 100M
git config --global core.compression 0
Data source: first 100k lines of https://data.illinois.gov/dataset/professional-licensing/resource/beef79ab-b679-4b0a-960a-05c218a619ba pulled with license_query.ipynb
Randomized 10k from this 100k, first and last names randomized
Data source: https://data.world/timothyrenner/bfro-sightings-data
Classification system described here: https://www.bfro.net/gdb/classify.asp
Data source: https://data.illinois.gov/dataset/87building_inventory
Link: https://github.com/UIUC-iSchool-DataViz/is445_bcubcg_fall2022/raw/main/data/michigan_lld.flt
Data source: Measurments taken from around Lake Michigan (https://www.ngdc.noaa.gov/mgg/greatlakes/michigan.html)
Data source: ?
This is a subset of the UFO dataset.
Link: https://raw.githubusercontent.com/UIUC-iSchool-DataViz/is445_data/main/ufo-subset-spring2023.csv
Data source: ?
Link: https://raw.githubusercontent.com/UIUC-iSchool-DataViz/is445_bcubcg_fall2022/main/data/GDP.csv
Data source: ?
Link: https://github.com/UIUC-iSchool-DataViz/is445_bcubcg_fall2022/raw/main/data/stitch_reworked.png
Data source: ? (I think a screenshot by Matt?)
The processing of this image into 3-colors is in the prep notebook for the first week as an "aside".
Link: https://github.com/UIUC-iSchool-DataViz/is445_bcubcg_fall2022/raw/main/data/littleCorgiInHat.png
Data source: ? google?
Link: https://github.com/UIUC-iSchool-DataViz/is445_bcubcg_fall2022/raw/main/data/single_dicom.h5
Data source: Brain scan of a colleague
Data Source: ?
Links:
- https://raw.githubusercontent.com/UIUC-iSchool-DataViz/is445_bcubcg_fall2022/main/data/data_tohoku_norm_transpose.csv
- https://raw.githubusercontent.com/UIUC-iSchool-DataViz/is445_bcubcg_fall2022/main/data/location.txt
Data Source: ? (its in one of the notebooks)
Data Source: https://www.ers.usda.gov/data-products/state-export-data/
Link: https://raw.githubusercontent.com/UIUC-iSchool-DataViz/is445_bcubcg_fall2022/main/data/mobility.csv
Data Source: A dataset of USA "mobility" which (I think comes from a a large census study from 1989-2015) and is collected in several places including right here. Here "mobility" is refering to how easy it is for a person to move up in economic status (more info can be found here) based on factors like parental income, location, race, etc.
Links:
- https://raw.githubusercontent.com/UIUC-iSchool-DataViz/is445_bcubcg_fall2022/main/data/corgs_per_country_over_time_columns_2020.csv (Corgis born per country over time)
- https://raw.githubusercontent.com/UIUC-iSchool-DataViz/is445_bcubcg_fall2022/main/data/corgiData_countries_subset_2020.json (Subset of full Corgi database)
Data Source: This dataset is from the Cardigan Archives and scraped using Beautiful Soup in Python and further processed in Python into this form.
Link: https://raw.githubusercontent.com/UIUC-iSchool-DataViz/is445_bcubcg_fall2022/main/data/LakeHuron.csv
Data Source: ? (Idyll somewhere...)
Link: https://github.com/UIUC-iSchool-DataViz/is445_bcubcg_fall2022/raw/main/data/galaxyFiles.zip
Data Source: Downsampled data from https://ui.adsabs.harvard.edu/abs/2011MNRAS.412.1341D/abstract and https://ui.adsabs.harvard.edu/abs/2012MNRAS.420.2221D/abstract
Link: http://yt-project.org/data/IsolatedGalaxy.tar.gz
Data Source: yt data hub
Link: https://raw.githubusercontent.com/UIUC-iSchool-DataViz/is445_bcubcg_fall2022/main/data/othello.txt
Data Source: https://www.gutenberg.org/files/1531/1531-h/1531-h.htm
Links:
- https://raw.githubusercontent.com/UIUC-iSchool-DataViz/is445_bcubcg_fall2022/main/data/facebook_combined_sm000030_000000.txt (One major node facebook data)
- https://raw.githubusercontent.com/UIUC-iSchool-DataViz/is445_bcubcg_fall2022/main/data/facebook_combined_sm000090_000010.txt (Several small nodes facebook data)
Data Source: SNAP facebook dataset - https://snap.stanford.edu/data/egonets-Facebook.html
- original full network: https://snap.stanford.edu/data/facebook_combined.txt.gz
- see notebook in last day of course for ideas of how to transform the original data into this format
Data source: http://exoplanetarchive.ipac.caltech.edu
- downloaded: Mon Jun 22 10:10:17 2020
See the geo_codes directory for more info about these.