SDAIA Bootcamp project 2 - web scraping/linear regression.
This project aims to predict ticket prices for upcoming flights to help customers in selecting the optimum time for travel and the cheapest flight to the desired destination. A random forest regression model is applied to forecast the flight prices based on data scraped from Kayak.
The project proposal can be found here.
The project MVP can be found here.
The Kayak Scraper Notebook can be found here.
The scraped data can be found here.
The project notebook can be found here.
- Source (4 Sources were selected for this project)
- Destination (4 Destinations were selected for this project)
- Total Stops
- Average Price per Airline
- Duration
- Price (Target)
Metric | Score |
---|---|
MAE | 61.87 |
MSE | 40409.87 |
RMSE | 201.02 |