chore(pipeline) : use Sentry as main notifier #369

vperron · 2025-01-15T15:59:19Z

Sentry is a much better tool than our little code executor, at least in terms of not spamming our alerting channel. We can precisely configure how and when we want to be notified, what is important, discuss about issues, etc.

It also now directly links every issue to a particular commit, making it easier to identify regressions.

The flow changes a little bit as the logs are in the Sentry issue, not directly linked in the Slack message, but I hope we'll be able to triage much better.

The goal here is to use the opportunity of migrating to Slack to kill 2 birds with one stone.

Example of the custom info added to Sentry :

vmttn

presque envie de simplement utiliser https://airflow.apache.org/docs/apache-airflow/stable/administration-and-deployment/logging-monitoring/errors.html

vmttn · 2025-01-16T07:14:50Z

pipeline/dags/dag_utils/sentry.py

+    with configure_scope() as scope:
+        dag_id = context.get("dag").dag_id
+        task_id = context.get("task_instance").task_id
+        execution_date = context.get("execution_date")


execution_date c'est déprécié depuis bien longtemps

oups :/ en effet

vmttn · 2025-01-16T07:22:27Z

pipeline/dags/dag_utils/sentry.py

+from sentry_sdk import configure_scope
+
+
+def task_failure_callback(context):


c'est assez hacky, puisque la callback ne fait rien d'autres que configurer une scope

oui, le nommage est naze, j'ai poussé hier avant de partir mais j'aurais dû faire un draft, toutes mes excuses 🙏

vmttn · 2025-01-16T07:23:16Z

deployment/main.tf

@@ -229,6 +229,7 @@ resource "null_resource" "up" {
    AIRFLOW_WWW_USER_PASSWORD='${var.airflow_admin_password}'
    AIRFLOW__CORE__FERNET_KEY='${var.airflow__core__fernet_key}'
    AIRFLOW__SENTRY__SENTRY_DSN='${var.airflow__sentry__sentry_dsn}'
+    AIRFLOW__SENTRY__RELEASE='${var.stack_version}'


pas fan du nommage

c'est parce que la variable release de la conf de sentry permet de lier une version du soft en question:
https://docs.sentry.io/product/sentry-basics/integrate-backend/configuration-options/

C'est peut etre un peu overkill, on peut juste mettre un tag avec le SHA et une URL vers le commit, mais je me disais que utiliser des maintenant la notion de "release" est un plus (les issues sont automatiquement liées au commit suspect, etc)

vperron · 2025-01-16T08:19:27Z

Si on utilise simplement le logging de base (comme aujourd'hui donc) on perd a minima le lien direct vers les logs Airflow depuis l'issue, que l'on avait jusqu'à aujourd'hui dans Mattermost.

Maintenant là où tu as raison c'est que on peut probablement utiliser le before_send pour enrichir le contexte, ça permet de ne pas annoter chaque DAG avec les arguments en question. Mais on aura pas aussi facilement accès aux propriétés du DAG, à tester.

Je trouve que configurer une release apporte un net plus car on peut immédiatement dire quel commit a fait apparaitre ou réapparaitre tel probleme.
Enfin avoir un lien direct vers le commit dans GH est aussi un plus assez pratique et terminé en 2/3 lignes de code, je trouve que c'est utile...

Enfin, tagger le nom du DAG et de la tâche est négligeable mais permet d'identifier super rapidement dans le bandeau de
droite les conditions de telle ou telle erreur, mais j'avoue que c'est peut etre un poil moins utile. Juste les erreurs sont assez riches en contenu (stacktrace, logs, breadcrumbs, ...) et je n'aime pas parser tout ça avec les yeux pour comprendre rapidement ce qui a planté, mais c'est une préférence personnelle.

Sentry is a much better tool than our little code executor, at least in terms of not spamming our alerting channel. We can precisely configure how and when we want to be notified, what is important, discuss about issues, etc. It also now directly links every issue to a particular commit, making it easier to identify regressions. The flow changes a little bit as the logs are in the Sentry issue, not directly linked in the Slack message, but I hope we'll be able to triage much better.

vperron requested a review from vmttn as a code owner January 15, 2025 15:59

vperron force-pushed the vperron/notify-sentry branch from 9738fc3 to 728b6ae Compare January 15, 2025 16:38

vmttn requested changes Jan 16, 2025

View reviewed changes

vperron force-pushed the vperron/notify-sentry branch from 728b6ae to ab05c24 Compare January 16, 2025 08:30

vperron requested a review from vmttn January 16, 2025 08:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore(pipeline) : use Sentry as main notifier #369

chore(pipeline) : use Sentry as main notifier #369

vperron commented Jan 15, 2025 •

edited

Loading

vmttn left a comment

vmttn Jan 16, 2025

vmttn Jan 16, 2025

vperron Jan 16, 2025

vmttn Jan 16, 2025

vperron Jan 16, 2025

vmttn Jan 16, 2025

vperron Jan 16, 2025

vperron commented Jan 16, 2025 •

edited

Loading

		from sentry_sdk import configure_scope


		def task_failure_callback(context):

chore(pipeline) : use Sentry as main notifier #369

Are you sure you want to change the base?

chore(pipeline) : use Sentry as main notifier #369

Conversation

vperron commented Jan 15, 2025 • edited Loading

vmttn left a comment

Choose a reason for hiding this comment

vmttn Jan 16, 2025

Choose a reason for hiding this comment

vmttn Jan 16, 2025

Choose a reason for hiding this comment

vperron Jan 16, 2025

Choose a reason for hiding this comment

vmttn Jan 16, 2025

Choose a reason for hiding this comment

vperron Jan 16, 2025

Choose a reason for hiding this comment

vmttn Jan 16, 2025

Choose a reason for hiding this comment

vperron Jan 16, 2025

Choose a reason for hiding this comment

vperron commented Jan 16, 2025 • edited Loading

vperron commented Jan 15, 2025 •

edited

Loading

vperron commented Jan 16, 2025 •

edited

Loading