Skip to content
This repository has been archived by the owner on Oct 3, 2020. It is now read-only.

Latest commit

 

History

History
31 lines (23 loc) · 1.15 KB

README.md

File metadata and controls

31 lines (23 loc) · 1.15 KB

ICCAS dataset code

This repository contains the code used for generating and updating the ICCAS dataset.

Brief description of scripts

  • download_reports.py: function and script for parsing the ISS News page in order to retrieve links to all PDF reports and download the ones missing in the reports folder (reports are not included in the git repository because they take MBs).

  • make_datasets.py: function and script for generating a new "single-date" dataset for each report in the reports folder and updating the "full dataset"; When run as script, it first calls the function download_missing_reports() contained in download_reports.py.

  • update_dataset.py: script meant to be run in a cronjob for automatically creating and deploying new datasets when a new report is published; it notifies me (via emails) in case of errors or success.

Installation (for my future self)

Replace <user> with your username:

conda create -n iccas python=3
conda activate iccas
pip install -r requirements.txt
python C:\Users\<user>\miniconda3\envs\iccas\Scripts\pywin32_postinstall.py -install