Skip to content

Latest commit

 

History

History
25 lines (14 loc) · 914 Bytes

File metadata and controls

25 lines (14 loc) · 914 Bytes

Trip-Advisor-Web-Scraping-and-Analysis-Auth-2023

This is a hands-on lab project on Web scraping with Selenium. We extract raw textual data from TripAdvisor, we preprocess and analyze them to unveil new knowledge. Specifically, we scrape TripAdvisor data for local Cafes and Bars businesses in Thessaloniki.

alt text

The tutorial will have the following sections:

A. Installing and setting up Selenium

B. Presentation of the TripAdvisor dataset requirements

C. TripAdvisor dataset web scraping process

D. Data preprocessing for textual TripAdvisor data

E. Final DataFrame format

F. Data Analysis in Python

G. Insightful conclusions



Aristotle Univeristy of Thessaloniki (AUTh)
Web Mining Course
Authors: Kyriaki Potamopoulou, Vasiliki Zarkadoula, Vasilis Daoulas