Skip to content

This makes the Kubernetes pod logs visible in the Airflow UI interface during runtime in GKE

License

Notifications You must be signed in to change notification settings

gadiener/bigquery-airflow-logger

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

BigQuery logger handler for Airflow

Installation

pip install airflow-bigquerylogger

Configuration

AIRFLOW__CORE__REMOTE_LOGGING='true'
AIRFLOW__CORE__REMOTE_BASE_LOG_FOLDER='gs://bucket/path'
AIRFLOW__CORE__REMOTE_LOG_CONN_ID='gcs_log'
AIRFLOW__CORE__LOGGING_CONFIG_CLASS='bigquerylogger.config.LOGGING_CLASS'
AIRFLOW__CORE__LOG_BIGQUERY_DATASET='dataset.table'
AIRFLOW__CORE__LOG_BIGQUERY_LIMIT=50

Google Cloud BigQuery

Rows that were written to a table recently via streaming (using the tabledata.insertall method) cannot be modified using UPDATE, DELETE, or MERGE statements. I recommend setting up a table retention!

Credits

Thanks to Bluecore engineering team for this usefull article.

About

This makes the Kubernetes pod logs visible in the Airflow UI interface during runtime in GKE

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages