This repository contains the code for a data pipeline that loads data related to the Euro 2024 tournament into a Postgres database.
The data is used to power a Metabase Dashboard on the EURO 2024 tournament.
The pipeline is built using the dlt library and uses data from the Sportmonks API.
To set up the project, you need to install the required dependencies. You can do this by running the following command:
pip install -r requirements.txt
Then, copy the .env.example
file to .env
and fill in the required environment variables.
cp .env.example .env
You can run the pipeline with the following command:
python sync.py [--full]
By default, the script will load the data for the current season. If you want to perform a full load of all the data, you can use the --full
option.
We scheduled the ETL process using Github Actions. You'll find the code for that in the .github/sync.yaml
file.