feat(eap): Start decoupling EAP entities at the entity layer #6701

colin-sentry · 2024-12-27T19:25:04Z

There's currently a big chunk of code in common.py that maps column accesses to the correct place.

However, it's all hard-coded to the spans table right now.

We would like to add new non-span entity types to the EAP RPCs, and the easiest place to do that is at the entity layer.

This essentially hides the 'real columns' behind a well-known set of columns (organization_id, attr_str, attr_f64, attr_i64, ...) which will be shared across all EAP entities.

When we add new entities, we can specify in the entity YAML what maps where in a way that the RPC doesn't have to know what type of entity is being processed.

Works towards solving https://github.com/getsentry/eap-planning/issues/126

phacops

We also shoul

snuba/datasets/configuration/events_analytics_platform/entities/eap_spans_rpc.yaml

phacops · 2025-01-03T21:51:59Z

snuba/datasets/configuration/events_analytics_platform/entities/eap_spans_rpc.yaml

+  [
+    { name: service, type: String },
+    { name: trace_id, type: UUID },
+    { name: span_id, type: UInt, args: { size: 64 } },


We should have some sort of a event_id, as a string, required in for the RPC. Having it as a string would let us pass any sort of UUIDs or span IDs.

How about uint128? String would have a big performance hit

Let's add that later, would require another net-new processor

snuba/datasets/configuration/events_analytics_platform/entities/eap_spans_rpc.yaml

onkar

Thanks for working on this. I left a few comments.

snuba/clickhouse/translators/snuba/mappers.py

snuba/datasets/configuration/events_analytics_platform/entities/eap_spans_rpc.yaml

tests/clickhouse/translators/snuba/test_translation.py

colin-sentry · 2025-01-07T22:04:44Z

snuba/clickhouse/translators/snuba/mappers.py

@@ -229,41 +227,6 @@ def attempt_map(
            return None


-@dataclass(frozen=True)
-class SubscriptableHashBucketMapper(SubscriptableReferenceMapper):


This gets moved into a query processor (the same one which handles mapKeys and mapContains)

colin-sentry · 2025-01-07T22:05:47Z

snuba/datasets/configuration/events_analytics_platform/entities/eap_spans.yaml

@@ -88,11 +74,16 @@ query_processors:
      curried_aggregation_names:
        - quantile
        - quantileTDigestWeighted
-  - processor: HashBucketFunctionTransformer


There was a really annoying order of operations, where query processors needed to know what bucket things would end up in, but that was done at the storage level.

By merging all of the processors which need to know the actual bucket-level information to a single one at the end, the pipeline is a lot more understandable and has less chance for bugs

colin-sentry · 2025-01-07T22:08:26Z

snuba/datasets/configuration/events_analytics_platform/entities/eap_spans_rpc.yaml

+storage_selector:
+  selector: DefaultQueryStorageSelector
+
+query_processors:


I rewrote a lot of this, there were starting to be conflicting edge cases where, e.g., sum(attr_f64[sentry.duration_ms]) should become sum(duration_ms), but the very similar

sum(attr_i64[blah]) should become

sumIf(CAST(attr_num_2[blah], 'Integer'), mapContains(attr_num_2, 'blah'))

colin-sentry · 2025-01-08T18:31:02Z

Someone one Pierre's team will be finishing this PR

colin-sentry requested review from a team as code owners December 27, 2024 19:25

colin-sentry requested a review from phacops December 27, 2024 19:46

colin-sentry mentioned this pull request Dec 27, 2024

(wip) Remove hardcoded references to eap_spans in EAP RPCs #6702

Closed

colin-sentry force-pushed the eap_entity_2 branch 2 times, most recently from 693b230 to cc41bc3 Compare December 30, 2024 21:15

phacops reviewed Jan 3, 2025

View reviewed changes

phacops requested a review from a team January 3, 2025 21:53

phacops reviewed Jan 3, 2025

View reviewed changes

snuba/datasets/configuration/events_analytics_platform/entities/eap_spans_rpc.yaml Show resolved Hide resolved

onkar reviewed Jan 4, 2025

View reviewed changes

snuba/clickhouse/translators/snuba/mappers.py Outdated Show resolved Hide resolved

snuba/datasets/configuration/events_analytics_platform/entities/eap_spans_rpc.yaml Outdated Show resolved Hide resolved

tests/clickhouse/translators/snuba/test_translation.py Outdated Show resolved Hide resolved

colin-sentry added 8 commits January 7, 2025 14:09

feat(eap): Start decoupling EAP entities at the entity layer

a715aa7

add test

6ee97f3

fix mypy

706ac91

fix test

21fb74f

feat: implement mapping for HashBucketFunctionTransformer

626d79f

fix: some review feedback

b632070

fix: make data type required

8626469

fix: mypy

45e6547

colin-sentry force-pushed the eap_entity_2 branch from 2640a01 to 45e6547 Compare January 7, 2025 19:09

colin-sentry requested review from phacops and onkar January 7, 2025 19:09

colin-sentry added 2 commits January 7, 2025 15:48

tmp

005de34

big refactor: simplify a lot of the remapping logic

5eb4974

colin-sentry commented Jan 7, 2025

View reviewed changes

colin-sentry added 2 commits January 7, 2025 17:13

mypy

6461c67

test: add a test for EAPClickhouseColumnRemapper

4310368

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(eap): Start decoupling EAP entities at the entity layer #6701

feat(eap): Start decoupling EAP entities at the entity layer #6701

colin-sentry commented Dec 27, 2024 •

edited

Loading

phacops left a comment

phacops Jan 3, 2025

colin-sentry Jan 6, 2025

colin-sentry Jan 7, 2025

onkar left a comment

colin-sentry Jan 7, 2025

colin-sentry Jan 7, 2025

colin-sentry Jan 7, 2025 •

edited

Loading

colin-sentry commented Jan 8, 2025 •

edited

Loading

feat(eap): Start decoupling EAP entities at the entity layer #6701

Are you sure you want to change the base?

feat(eap): Start decoupling EAP entities at the entity layer #6701

Conversation

colin-sentry commented Dec 27, 2024 • edited Loading

phacops left a comment

Choose a reason for hiding this comment

phacops Jan 3, 2025

Choose a reason for hiding this comment

colin-sentry Jan 6, 2025

Choose a reason for hiding this comment

colin-sentry Jan 7, 2025

Choose a reason for hiding this comment

onkar left a comment

Choose a reason for hiding this comment

colin-sentry Jan 7, 2025

Choose a reason for hiding this comment

colin-sentry Jan 7, 2025

Choose a reason for hiding this comment

colin-sentry Jan 7, 2025 • edited Loading

Choose a reason for hiding this comment

colin-sentry commented Jan 8, 2025 • edited Loading

colin-sentry commented Dec 27, 2024 •

edited

Loading

colin-sentry Jan 7, 2025 •

edited

Loading

colin-sentry commented Jan 8, 2025 •

edited

Loading