Skip to content

BCCN-Prog/webscraping_2018

Repository files navigation

webscraping_2018

This repository has the set of files that gather information from the websites bild.de and wetter.de as a webscraping service, and from the weather channel by RESTful API calls. The scripts that gather the data run on a server as cronjobs. The way they run is described by: crontab_info.txt

The structure for the RESTful API calls is the following:

  • api_info.py has the necessary information to access the wunderground API.

  • constants.py has the global constants used across API scripts.

  • city_location.py is the script that gets the coordinates of specified named cities.

  • daily_db.py is the script that gathers daily data.

  • hourly_db.py is the script that gathers hourly data.

The structure for Wetter.de scraping is:

  • Wetter_de_scraping.py scrapes hourly data.

  • Web_Scraping_wetter_de_full_day.py scrapes daily data.

  • Web_Scraping_wetter_de_day_periods.py scrapes periods of the day.

For bild.de:

  • bild_scraping.py does both daily and daily period scraping.

The helper scripts for database insertion are:

  • database.py
  • db_manager.py
  • db_info.py

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages