This repository has the set of files that gather information from the websites bild.de and wetter.de as a webscraping service, and from the weather channel by RESTful API calls.
The scripts that gather the data run on a server as cronjobs. The way they run is described by: crontab_info.txt
The structure for the RESTful API calls is the following:
-
api_info.py
has the necessary information to access the wunderground API. -
constants.py
has the global constants used across API scripts. -
city_location.py
is the script that gets the coordinates of specified named cities. -
daily_db.py
is the script that gathers daily data. -
hourly_db.py
is the script that gathers hourly data.
The structure for Wetter.de scraping is:
-
Wetter_de_scraping.py
scrapes hourly data. -
Web_Scraping_wetter_de_full_day.py
scrapes daily data. -
Web_Scraping_wetter_de_day_periods.py
scrapes periods of the day.
For bild.de:
bild_scraping.py
does both daily and daily period scraping.
The helper scripts for database insertion are:
database.py
db_manager.py
db_info.py