Skip to content

Latest commit

 

History

History
122 lines (83 loc) · 6.32 KB

README.md

File metadata and controls

122 lines (83 loc) · 6.32 KB

BART Passenger Heatmap

Shiny app for displaying BART hourly origin-destination data from 2011 - June 2021.

Note that there used to be a shinyapps.io deployment example. However, it uses too much space now for the free tier, so it's been taken down. You can run direct from RStudio with the following:

library(shiny)
shiny::runGitHub("bart-passenger-heatmap", "jlperona")

Background

I created this application for a class that I took while in graduate school.

Previous Work

This application is heavily based on my previous work, bart-hourly-dataset-parser, which is licensed under the MIT License. I used the Python script and the Pajek NET file in that repository to build the ones in this repository. The methodology is almost identical to the one in that script, as well.

Reasoning

I wanted to see the BART network mapped with actual GIS data, and with the passenger data that BART provides on their website. One of the ways of doing that was by combining Shiny, which helps me create a web application, with leaflet, which lets me map the GIS data. I'm sure there are other ways of visualizing this data, but just like with bart-hourly-dataset-parser, I made this for a class project. I needed to use R, so this was a reasonable choice.

For true heatmapping purposes, using the actual track data isn't the best since tracks vary in size. This is visible on the current track data. A stylized version, like the BART system map, would probably be better.

Methodology

There are two main pieces to this application:

  • the preparser, which is in preparser/
  • the actual Shiny code, which is in the .R files in the main directory

Preparser

See the README for the preparser for more information on the preparser.

Shiny Application

The Shiny application takes in the preparser output, as mentioned above. It also takes in GIS data in the GeoJSON format that is provided in geojson/. (For an explanation of the GIS data, see the GeoJSON README.) Its output is a heatmap of where the passengers traveled on the network.

The Shiny application does the following:

  1. Load the preparsed data.
  2. Load the station and track GIS data.
  3. Render the base map, the stations, and the sidebar.
  4. Render the tracks when the app is loaded and when the update button is pressed.
    1. Validate the user's selections for the date-time.
    2. Subset the preparsed data by the date-time.
    3. Aggregate the data for each edge.
    4. Color each track based on how many passengers traveled on that edge.
    5. Create a popup containing exactly how many passengers traveled on that edge.

Resources Used

I used a variety of resources to create this application.

Starting Out

This blog post on DataScience.com was amazing for getting off the ground with mapping data using leaflet and Shiny together. If you're looking to get started doing something like this, I highly recommend going through that tutorial and modifying it to suit your needs.

Package Documentation

The references for the following packages proved invaluable to getting this working:

Shiny Examples

Many of the Shiny examples were helpful for demonstrating different aspects of Shiny. In particular, I used the following:

Stack Overflow

Finally, I utilized quite a few posts on StackOverflow to answer some questions that I had. These include the following:

Other References

This page from UC Berkeley was very helpful for working with POSIXct.

Lessons Learned

A few things that I became very familiar with over the course of working on this project:

  • Make sure to check that your strings are imported as characters, and not factors, if that's necessary for your logic. I spent a long time debugging that issue.
  • Make sure you didn't accidentally overwrite your input data. Sometimes you accidentally modify it, even when you didn't mean to.
  • Validating your data is important. I knew that from prior experience, but it was admittedly funny when I crashed the app by selecting no hours to display.
  • Good comments are incredibly helpful when coming back to this project after 2 years.

Future

One idea I had was to create different types of visualizations with this data. Exactly what types of visualizations is a very good question. Maybe in the future I'll consider it further.