[parallel] If not num_threads, use queue instead of tail recursion #306

mpsijm · 2023-09-13T10:11:44Z

Previously, running bt fuzz --jobs 0 would crash with a stack overflow after 244 iterations (±4 stack frames per iteration). This happened because after every GeneratorTask, a new task was started by calling Parallel.put, which in turn called its lambda, which called the new GeneratorTask, which called Parallel.put again in finish_task, etc.

I've rewritten the task handling in the case of not num_threads by moving it all to Parallel.done. Parallel.put now puts the task in the same priority queue as in the case where we do want parallelism (num_threads > 0). The fuzzer should now be able to run more than 244 iterations again 😄

I created a PR instead of simply pushing to master, since I haven't touched all of this parallel stuff that often, so some extra eyes would be nice 🙂

To quickly reproduce the bug with `bt fuzz -j0`, I added a minimal generator to the test problem "hellowholeworld", which makes 244 iterations take about 2 minutes:

diff --git a/test/problems/hellowholeworld/generators/gen.py b/test/problems/hellowholeworld/generators/gen.py
new file mode 100644
index 0000000..829afc9
--- /dev/null
+++ b/test/problems/hellowholeworld/generators/gen.py
@@ -0,0 +1,6 @@
+#!/usr/bin/env python3
+import random
+import sys
+
+random.seed(sys.argv[1])
+print(random.randint(1, 100))
diff --git a/test/problems/hellowholeworld/generators/generators.yaml b/test/problems/hellowholeworld/generators/generators.yaml
new file mode 100644
index 0000000..2f90cce
--- /dev/null
+++ b/test/problems/hellowholeworld/generators/generators.yaml
@@ -0,0 +1,9 @@
+solution: /submissions/accepted/test-hello.py
+data:
+  sample:
+    data:
+      "1":
+        in: "1"
+  secret:
+    data:
+      - "": gen.py {seed}

bin/parallel.py

bin/fuzz.py

bin/parallel.py

mpsijm · 2024-02-03T11:42:23Z

Better late than never 😄

While trying to resolve your feedback, I came to the conclusion that having one class that's doing very different things in the sequential vs. the parallel case is not very maintainable. So, I decided to split the two functionalities into two subclasses of a common abstract superclass.

The consequence of this is that @mzuenni's fix for the fuzzer in #327 is no longer necessary: the user of parallel.create can always transparently treat it as if it was a parallel queue, even though it's secretly a sequential queue when args.jobs == 0 😄

RagnarGrootKoerkamp

Mostly LGTM; just some small comments.

Still not entirely convinced we actually need this but happy to merge either way.

bin/fuzz.py

bin/parallel.py

…t_ansfile`

Previously, running `bt fuzz --jobs 0` would crash with a stack overflow after 244 iterations (±4 stack frames per iteration). This happened because after every GeneratorTask, a new task was started by calling Parallel.put, which in turn called its lambda, which called the new GeneratorTask, which called Parallel.put again in finish_task, etc. I've rewritten the task handling in the case of `not num_threads` by moving it all to Parallel.done. Parallel.put now puts the task in the same priority queue as in the case where we _do_ want parallelism (num_threads > 0). The fuzzer should now be able to run more than 244 iterations again 😄

This reverts commit 97ab1d7.

mpsijm requested review from RagnarGrootKoerkamp and mzuenni September 13, 2023 10:11

mzuenni reviewed Sep 13, 2023

View reviewed changes

bin/parallel.py Outdated Show resolved Hide resolved

RagnarGrootKoerkamp reviewed Sep 13, 2023

View reviewed changes

bin/fuzz.py Outdated Show resolved Hide resolved

bin/parallel.py Outdated Show resolved Hide resolved

bin/parallel.py Show resolved Hide resolved

mpsijm mentioned this pull request Dec 31, 2023

[parallel] If not num_threads dont use tail recursion #327

Merged

mpsijm force-pushed the fix-parallel-without-threads branch from 91e67e7 to 50c2d1d Compare February 3, 2024 11:42

RagnarGrootKoerkamp approved these changes Feb 4, 2024

View reviewed changes

bin/fuzz.py Outdated Show resolved Hide resolved

bin/fuzz.py Outdated Show resolved Hide resolved

bin/parallel.py Show resolved Hide resolved

mpsijm force-pushed the fix-parallel-without-threads branch from 50c2d1d to 896b747 Compare February 10, 2024 16:34

mpsijm added 5 commits February 16, 2024 08:15

[fuzz] Remove condition that referred to non-existing variable `targe…

abba66f

…t_ansfile`

Revert "fix fuzz for -j 0"

9416c09

This reverts commit 97ab1d7.

[parallel] Explicitly split up SequentialQueue and ParallelQueue

87d2e5d

[parallel] Improve documentation

f348174

mpsijm force-pushed the fix-parallel-without-threads branch from 896b747 to f348174 Compare February 16, 2024 07:15

RagnarGrootKoerkamp approved these changes Feb 16, 2024

View reviewed changes

mpsijm merged commit d502dcd into RagnarGrootKoerkamp:master Feb 16, 2024
1 check passed

mpsijm deleted the fix-parallel-without-threads branch February 16, 2024 09:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[parallel] If not num_threads, use queue instead of tail recursion #306

[parallel] If not num_threads, use queue instead of tail recursion #306

mpsijm commented Sep 13, 2023

mpsijm commented Feb 3, 2024

RagnarGrootKoerkamp left a comment

[parallel] If not num_threads, use queue instead of tail recursion #306

[parallel] If not num_threads, use queue instead of tail recursion #306

Conversation

mpsijm commented Sep 13, 2023

mpsijm commented Feb 3, 2024

RagnarGrootKoerkamp left a comment

Choose a reason for hiding this comment