[data] Fix map operator fusion when concurrency is set #49573

raulchen · 2025-01-04T00:51:38Z

Why are these changes needed?

Current operator fusion rule doesn't consider concurrency.
This PR fixes this issue by only allow fusing 2 operators when they have the same concurrency.
For task->actor, we allow fusion when task's concurrency = actor's upper bound.
Also fixed some type hinting issues regarding compute strategy.

Related issue number

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Signed-off-by: Hao Chen <chenh1024@gmail.com>

Jay-ju · 2025-01-06T00:38:57Z

Can the fuse operator be used as a switch to ignore the choice of GPU/CPU and concurrent settings? As long as the switch is turned on, fusion is performed. This is very useful for the situation of out-of-memory (OOM).

gvspraveen · 2025-01-06T03:26:59Z

python/ray/data/_internal/logical/rules/operator_fusion.py

+        Task->Task and Task->Actor are allowed.
+        Actor->Actor and Actor->Task are not allowed.
+        """
+        if isinstance(up_compute, ActorPoolStrategy):


Any reason you are disallowing upstream op being Actor here?

Previous logic seems to permit Actor -> Task as long as compute are compatible/same.

is_task_compute(down_logical_op._compute) and get_compute( up_logical_op._compute ) != get_compute(down_logical_op._compute) return False

Actor -> Task is already disallowed by this condition above.

for future. if resource requirements are same, should we allow Actor -> Task fuse?

raulchen · 2025-01-06T18:52:09Z

Can the fuse operator be used as a switch to ignore the choice of GPU/CPU and concurrent settings? As long as the switch is turned on, fusion is performed. This is very useful for the situation of out-of-memory (OOM).

Can you just set the same args to allow fusion?

Signed-off-by: Hao Chen <chenh1024@gmail.com>

Jay-ju · 2025-01-08T01:22:08Z

Can the fuse operator be used as a switch to ignore the choice of GPU/CPU and concurrent settings? As long as the switch is turned on, fusion is performed. This is very useful for the situation of out-of-memory (OOM).

Can you just set the same args to allow fusion?

Is it also possible if one is a CPU and the other is a GPU?

…9573) Current operator fusion rule doesn't consider concurrency. This PR fixes this issue by only allow fusing 2 operators when they have the same concurrency. For task->actor, we allow fusion when task's concurrency = actor's upper bound. Also fixed some type hinting issues regarding compute strategy. --------- Signed-off-by: Hao Chen <chenh1024@gmail.com> Signed-off-by: Roshan Kathawate <roshankathawate@gmail.com>

raulchen added 2 commits January 3, 2025 15:36

fix fusing compute strategies

ae7a72e

Signed-off-by: Hao Chen <chenh1024@gmail.com>

add test

70baaee

Signed-off-by: Hao Chen <chenh1024@gmail.com>

raulchen requested a review from a team as a code owner January 4, 2025 00:51

gvspraveen reviewed Jan 6, 2025

View reviewed changes

gvspraveen approved these changes Jan 6, 2025

View reviewed changes

bveeramani approved these changes Jan 6, 2025

View reviewed changes

raulchen added 2 commits January 6, 2025 13:51

lint

7b69320

Signed-off-by: Hao Chen <chenh1024@gmail.com>

fix

42259a4

Signed-off-by: Hao Chen <chenh1024@gmail.com>

raulchen enabled auto-merge (squash) January 7, 2025 18:05

github-actions bot added the go add ONLY when ready to merge, run all tests label Jan 7, 2025

raulchen merged commit 25ca8aa into ray-project:master Jan 7, 2025
6 of 7 checks passed

raulchen deleted the fix-fused-concurrency branch January 7, 2025 21:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[data] Fix map operator fusion when concurrency is set #49573

[data] Fix map operator fusion when concurrency is set #49573

raulchen commented Jan 4, 2025

Jay-ju commented Jan 6, 2025

gvspraveen Jan 6, 2025 •

edited

Loading

raulchen Jan 6, 2025

gvspraveen Jan 6, 2025 •

edited

Loading

raulchen commented Jan 6, 2025

Jay-ju commented Jan 8, 2025

[data] Fix map operator fusion when concurrency is set #49573

[data] Fix map operator fusion when concurrency is set #49573

Conversation

raulchen commented Jan 4, 2025

Why are these changes needed?

Related issue number

Checks

Jay-ju commented Jan 6, 2025

gvspraveen Jan 6, 2025 • edited Loading

Choose a reason for hiding this comment

raulchen Jan 6, 2025

Choose a reason for hiding this comment

gvspraveen Jan 6, 2025 • edited Loading

Choose a reason for hiding this comment

raulchen commented Jan 6, 2025

Jay-ju commented Jan 8, 2025

gvspraveen Jan 6, 2025 •

edited

Loading

gvspraveen Jan 6, 2025 •

edited

Loading