Skip to content

Commit

Permalink
Update READMEs with better cluster create instructions
Browse files Browse the repository at this point in the history
  • Loading branch information
imreddy13 committed Mar 6, 2024
1 parent 9799818 commit 208b108
Show file tree
Hide file tree
Showing 2 changed files with 6 additions and 18 deletions.
20 changes: 3 additions & 17 deletions applications/rag/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -32,7 +32,7 @@ CLUSTER_REGION=us-central1
```
2. Use the following instructions to create a GKE cluster. We recommend using Autopilot for a simpler setup.

##### Autopilot (recommended)
##### Autopilot

RAG requires the latest Autopilot features, available on GKE cluster version `1.29.1-gke.1575000`+
```
Expand All @@ -46,23 +46,9 @@ gcloud container clusters create-auto ${CLUSTER_NAME:?} \
--cluster-version ${CLUSTER_VERSION:?}
```

##### Standard
##### Standard (recommended)

1. To create a GKE Standard cluster using Terraform, please follow the [instructions here](https://github.com/GoogleCloudPlatform/ai-on-gke/blob/main/infrastructure/README.md).

TODO: Add GKE cluster requirements for a successful installation.

2. The inference server requires L4 GPUs. Create an additional node pool:
```
gcloud container node-pools create g2-standard-24 --cluster ${CLUSTER_NAME:?} \
--accelerator type=nvidia-l4,count=2,gpu-driver-version=latest \
--machine-type g2-standard-24 \
--ephemeral-storage-local-ssd=count=2 \
--enable-image-streaming \
--num-nodes=1 --min-nodes=1 --max-nodes=2 \
--node-locations ${CLUSTER_REGION:?}-a,${CLUSTER_REGION:?}-b \
--location=${CLUSTER_REGION:?}
```
1. To create a GKE Standard cluster using Terraform, follow the [instructions here](https://github.com/GoogleCloudPlatform/ai-on-gke/blob/main/infrastructure/README.md). Use the preconfigured node pools in `/infrastructure/platform.tfvars` as this solution requires T4s and L4s.

#### Setup Components

Expand Down
4 changes: 3 additions & 1 deletion infrastructure/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,9 @@

Platform module (to be renamed to Infra), creates the GKE cluster & other related resources for the AI applications / workloads to be deployed on them.

Update the ```platform.tfvars``` file with the required configuration. Kindly refer to ```tfvars_examples``` for sample configuration.
1) Update the ```platform.tfvars``` file with the required configuration. Kindly refer to ```tfvars_examples``` for sample configuration.

2) Run `terraform init` and `terraform apply --var-file=platform.tfvars`


## Prerequisites
Expand Down

0 comments on commit 208b108

Please sign in to comment.