Simplify delivery loop logic #1743

fractalwrench · 2024-12-04T17:36:08Z

Goal

Simplifies the delivery loop logic to avoid checking state as this was causing flakes in our integration tests.

It's worth noting that (1) deliveryLoop can now be called multiple times and (2) queueDelivery is much more likely to fail as a consequence of this change, once old payloads are deleted from disk. FWIW, both of these conditions could occur with the old approach too via scheduleDeliveryLoopForNextRetry.

One alternative given that deliveryWorker is only ever used in this class would be to have deliveryWorker infinitely poll a BlockingQueue, and to have schedulingWorker add to that queue. This adds complications around checking for uniqueness on the elements in the queue & is a larger change, however.

Alternatively, given this is just an issue in testing & not really with deliverability we could just update our test harness to flush any payloads that are left hanging.

Testing

I ran the integration tests ~15 times and they passed consistently with these changes, whereas without these changes one test case usually fails on every other run.

github-actions · 2024-12-04T17:36:21Z

Dependency Review

✅ No vulnerabilities or license issues or OpenSSF Scorecard issues found.

OpenSSF Scorecard

Package	Version	Score	Details

Scanned Files

codecov · 2024-12-04T17:46:46Z

Codecov Report

Attention: Patch coverage is 90.90909% with 1 line in your changes missing coverage. Please review.

Project coverage is 85.34%. Comparing base (9b5b93e) to head (6c214b9).
Report is 2 commits behind head on main.

Files with missing lines	Patch %	Lines
...esdk/internal/delivery/debug/DeliveryTraceState.kt	50.00%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1743      +/-   ##
==========================================
+ Coverage   85.31%   85.34%   +0.03%     
==========================================
  Files         464      464              
  Lines       10783    10775       -8     
  Branches     1596     1591       -5     
==========================================
- Hits         9199     9196       -3     
  Misses        870      870              
+ Partials      714      709       -5

Files with missing lines	Coverage Δ
...bracesdk/internal/delivery/debug/DeliveryTracer.kt	`82.60% <100.00%> (ø)`
...ernal/delivery/scheduling/SchedulingServiceImpl.kt	`87.82% <100.00%> (+0.02%)`	⬆️
...esdk/internal/delivery/debug/DeliveryTraceState.kt	`55.10% <50.00%> (ø)`

... and 3 files with indirect coverage changes

bidetofevil

Looking at the code, I think simplifying is the right choice.

But looking at this logic, I think we can simplify even further - instead of calling createPayloadQueue to return all the outstanding payloads, we just need the next one - the queue will be refreshed at every iteration anyway.

Basically, this means we are managing the loop via jobs to schedulingWorker, which will ensure that at every iterate, we look at the latest state of the in-memory cache, which seems fine? If there's a excess of jobs that run and find that nothing is to be delivered, it's not so bad. There shouldn't even be that much backpressure built up as result given how fast the job should run

bidetofevil · 2024-12-04T19:13:54Z

...n/kotlin/io/embrace/android/embracesdk/internal/delivery/scheduling/SchedulingServiceImpl.kt

-            deliveryTracer?.onStartDeliveryLoop(false)
+        deliveryTracer?.onStartDeliveryLoop()
+        schedulingWorker.submit {
+            deliveryLoop()
        }
    }



[Re: line +78]

Change this method to return only the next payload to be delivered. It'll basically poll at every iteration and bail when there are none to be delivered.

See this comment inline on Graphite.

I attempted this but it became problematic as only taking the first payload means it would be necessary to track state for payloads that were attempted & weren't attempted. Without adding that state I simply ran into GC problems as the proposed change would loop indefinitely

So the in-progressed payload should be filtered out in with the filter on activeSends inside StoredTelemetryMetadata.shouldSendPayload(), which is called inside createPayloadQueue. Doesn't that happen if you simply changed createPayloadQueue to return the whole LinkedList but only the first element?

e.g. change that function to:

private fun createPayloadQueue(exclude: Set<StoredTelemetryMetadata> = emptySet()): StoredTelemetryMetadata? { val payloadsByPriority = storageService.getPayloadsByPriority() val payloadToSend = payloadsByPriority .filter { it.shouldSendPayload() && !exclude.contains(it) } .sortedWith(storedTelemetryComparator) .first() deliveryTracer?.onPayloadQueueCreated( payloadsByPriority, payloadToSend, ) return payloadToSend }

It doesn't work for the case where a payload isn't enqueued (e.g. if an endpoint is rate-limited) and hangs the application. We could track those payloads too but I'm wary that might be adding more complexity on top. We can chat synchronously about options later today

Disregard my comment above - I think I just made a mistake in the original implementation of these changes & things now work fine on my latest iteration. There are only 2 states I can see (failed vs active) so this approach should work fine.

bidetofevil

LGTM. One comment about not returning the entire queue but just the next payload since we'll never need more than that now.

bidetofevil

Awesome. We'll watch out for new flakes after this is merged

$fractalwrench$

$@fractalwrench$ fractalwrench requested a review from bidetofevil December 4, 2024 17:36

bidetofevil reviewed Dec 4, 2024

View reviewed changes

bidetofevil approved these changes Dec 4, 2024

View reviewed changes

$@fractalwrench$ fractalwrench force-pushed the simplify-delivery-loop branch from 24fd780 to 220a57a Compare December 6, 2024 16:28

$@fractalwrench$ fractalwrench marked this pull request as ready for review December 6, 2024 16:30

$@fractalwrench$ fractalwrench requested a review from a team as a code owner December 6, 2024 16:30

$@fractalwrench$ fractalwrench requested a review from bidetofevil December 6, 2024 16:30

bidetofevil approved these changes Dec 6, 2024

View reviewed changes

$@fractalwrench$

simplify delivery loop logic

6c214b9

$@fractalwrench$ fractalwrench force-pushed the simplify-delivery-loop branch from 220a57a to 6c214b9 Compare December 9, 2024 10:03

$@fractalwrench$ fractalwrench merged commit f3c3d7f into main Dec 9, 2024
7 checks passed

$@fractalwrench$ fractalwrench deleted the simplify-delivery-loop branch December 9, 2024 10:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simplify delivery loop logic #1743

Simplify delivery loop logic #1743

$@fractalwrench$ fractalwrench commented Dec 4, 2024 •

edited

Loading

github-actions bot commented Dec 4, 2024 •

edited

Loading

codecov bot commented Dec 4, 2024 •

edited

Loading

bidetofevil left a comment

bidetofevil Dec 4, 2024

$@fractalwrench$ fractalwrench Dec 5, 2024

bidetofevil Dec 5, 2024

bidetofevil Dec 5, 2024

$@fractalwrench$ fractalwrench Dec 6, 2024

$@fractalwrench$ fractalwrench Dec 6, 2024

bidetofevil left a comment

bidetofevil left a comment

Simplify delivery loop logic #1743

Simplify delivery loop logic #1743

Conversation

fractalwrench commented Dec 4, 2024 • edited Loading

Goal

Testing

github-actions bot commented Dec 4, 2024 • edited Loading

Dependency Review

OpenSSF Scorecard

Scanned Files

codecov bot commented Dec 4, 2024 • edited Loading

Codecov Report

bidetofevil left a comment

Choose a reason for hiding this comment

bidetofevil Dec 4, 2024

Choose a reason for hiding this comment

fractalwrench Dec 5, 2024

Choose a reason for hiding this comment

bidetofevil Dec 5, 2024

Choose a reason for hiding this comment

bidetofevil Dec 5, 2024

Choose a reason for hiding this comment

fractalwrench Dec 6, 2024

Choose a reason for hiding this comment

fractalwrench Dec 6, 2024

Choose a reason for hiding this comment

bidetofevil left a comment

Choose a reason for hiding this comment

bidetofevil left a comment

Choose a reason for hiding this comment

$@fractalwrench$ fractalwrench commented Dec 4, 2024 •

edited

Loading

github-actions bot commented Dec 4, 2024 •

edited

Loading

codecov bot commented Dec 4, 2024 •

edited

Loading

$@fractalwrench$ fractalwrench Dec 5, 2024

$@fractalwrench$ fractalwrench Dec 6, 2024

$@fractalwrench$ fractalwrench Dec 6, 2024