Skip to content

Commit

Permalink
Marwan/summit compute config fixes (#345)
Browse files Browse the repository at this point in the history
Address the issues with the current compute configs in Ray Summit
Templates after testing in staging.

A prompt turnaround here would be very much appreciated

Summary of changes:
- [worker_node_types instead of
worker_nodes](b4ccef3)
- [add head node
name](a1c7367)
- [set custom resources
correctly](97556ae)
  • Loading branch information
marwan116 authored Sep 24, 2024
1 parent 32d6ed6 commit c767f0e
Show file tree
Hide file tree
Showing 4 changed files with 8 additions and 3 deletions.
3 changes: 2 additions & 1 deletion configs/ray-summit-end-to-end-llms/aws.yaml
Original file line number Diff line number Diff line change
@@ -1,10 +1,11 @@
head_node_type:
name: head_node
instance_type: m5.4xlarge
resources:
cpu: 0
gpu: 0

worker_nodes:
worker_node_types:
- instance_type: m5.4xlarge
name: 16CPU-64GB
min_workers: 1
Expand Down
1 change: 1 addition & 0 deletions configs/ray-summit-multi-modal-search/aws.yaml
Original file line number Diff line number Diff line change
@@ -1,4 +1,5 @@
head_node_type:
name: head_node
instance_type: m5.4xlarge
resources:
cpu: 0
Expand Down
4 changes: 3 additions & 1 deletion configs/ray-summit-rag/aws.yaml
Original file line number Diff line number Diff line change
@@ -1,9 +1,11 @@
head_node_type:
instance_type: m5.4xlarge
name: head_node
resources:
cpu: 0
gpu: 0
is_head_node: 1
custom_resources:
is_head_node: 1

worker_node_types:
- instance_type: g6.4xlarge
Expand Down
3 changes: 2 additions & 1 deletion configs/ray-summit-stable-diffusion/aws.yaml
Original file line number Diff line number Diff line change
@@ -1,10 +1,11 @@
head_node_type:
name: head_node
instance_type: g6.4xlarge
resources:
cpu: 12
gpu: 1

worker_nodes:
worker_node_types:
- instance_type: g6.4xlarge
name: '1xL4:16CPU-64GB'
min_workers: 1
Expand Down

0 comments on commit c767f0e

Please sign in to comment.