-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Local Socrata ETL development #1252
Conversation
@tillyw The staging ETL overwrote the micromobility data in the test datasets over the weekend so I replaced the data again and turned off the Airflow DAG so it will stick around this time. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I see the Ys in Socrata, ✅
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good, I see Y's! Just to be sure, there should only be the option to filter by "Y", right?
@roseeichelmann Yes, Socrata creates those filter options based on the content of the dataset. The "Y" values are proof that something that is not in the production dataset can be pushed to the test dataset to develop ETL/VZV updates, and the values could change in the future depending on where the rest of Tilly's work goes. 🚀 🙌 Going to go ahead and merge this - thanks y'all! |
Associated issues
This is purely to unblock local development of the Socrata ETL. Thanks to @tillyw and @frankhereford for the discussion on this. I realized last night that I had started exploring this a while back but forgot. The biggest update here is that there is a new environment file in 1Password that hands the ETL details that point it to the local VZ DB so it can populate the test datasets. Since the data source is still production data, the test datasets look production-like but also makes schema change testing possible.
This should hold us over until we migrate this to Airflow v2 and refactor it a little.
Testing
URL to test:
n/a
Steps to test:
You can check that this worked by going to https://data.austintexas.gov/Transportation-and-Mobility/-UNDER-CONSTRUCTION-TEST-Crash-Report-Data/3aut-fhzp and filtering the dataset to
micromobility_fl = Y
.or
$ runetl ~/.ssh/atd-etl/etl.local.env app/process_socrata_export.py
Ship list