Skip to content

Latest commit

 

History

History
21 lines (14 loc) · 1.63 KB

README.md

File metadata and controls

21 lines (14 loc) · 1.63 KB

Analyzing COVID-19 Impact on NYC Public Transit

View website for a more comprehensive analysis and breakdown of the project.

Overview

The purpose of this project was to try to determine how the Covid-19 pandemic impacted subway ridership in NYC, and if the vaccination efforts were helping to bring the ridership numbers to pre-pandemic levels. My hypothesis was that the pandemic affected subway ridership in a negative way, however that the vaccination efforts would help bring it to what the MTA would consider normal levels. I gathered vaccination and coronavirus cases data and daily ridership number from the MTA website. I then cleaned the data because the method of data collection varied depending on which agency it was collected from. I then applied linear regression and correlation and plotted those as well. I also created preliminary plots to showcase the data to understand how Vaccinations and Cases played in Daily Ridership from the start of the pandemic. Finally I used the SARIMA model to predict how ridership numbers would change in the next 6 months.

These techniques/methods were accomplished using the following libraries: Matplotlib, SciPy, seaborn, Pandas, and Scikit-Learn.

Visualizations

Cases and NYC Ridership: Cases and NYC Ridership

Vaccines and NYC Ridership: Vaccines and NYC Ridership

Fitting the SARIMA model to the data: SARIMA Model fit

The final SARIMA model forecasting: SARIMA Forecasting