Skip to content

A script that scrapes information from airline travel waiver emails from Microsoft Outlook to create a structured data set of historical events that have impacted travel.

Notifications You must be signed in to change notification settings

eli64s/python_outlook_email_parser

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

35 Commits
 
 
 
 
 
 

Repository files navigation

Python Microsoft Outlook Email Scraper

This script scrapes unstructured Microsoft Outlook emails regarding Travel Waivers (example image below) that United Airlines issues when travel plans may be impeded by events like severe weather/epidemics/etc.

The email data is cleaned and manipulated into a DataFrame that includes the Travel Waiver's issued date, event name, category, severity level, cities impacted, and dates the waiver is issued for.

The code can be broken down into the following 3 sections:

1. Data mine Outlook to collect all Travel Waiver emails
2. Create classes/functions to parse email data
3. Iterate over the data set to clean and structure it

Over time, the collection of this data may yield some interesting insights into events that impact travel. Feel free to ask questions or leave suggestions/critique, I love to learn other's approaches to problems!

TravelWaiverImage

About

A script that scrapes information from airline travel waiver emails from Microsoft Outlook to create a structured data set of historical events that have impacted travel.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published