Name		Name	Last commit message	Last commit date
parent directory ..
images		images
Dockerfile		Dockerfile
README.md		README.md
diffusion_utils.py		diffusion_utils.py
entrypoint.sh		entrypoint.sh
generate_request_vertex.py		generate_request_vertex.py
load_weights.py		load_weights.py
main.py		main.py
requirements.txt		requirements.txt
test_container.py		test_container.py
validate_response.py		validate_response.py

README.md

Stable Diffusion

prompt: A woman dressed like the Mexican Holiday Dia de los Muertos

Intro

This repo containerizes stable diffusion using Huggingface's diffusers library into a serving container using fastapi which can be served with Vertex AI prediction.

The model license can be found here.

Features:

Text to image.
Image to image.
Inpainting.
Uses xformers and attention slicing and fp16 to reduce GPU memory.

Setup

Cone repo if you haven't. Navigate to the serving-stable-diffusion folder.

Build container. Change the project-id to yours. Right now model_name only supports models hosted in Huggingface. In the future models from other sources will be supported.

PROJECT_ID=<project-id>
docker build -t gcr.io/$PROJECT_ID/serving-sd:latest --build-arg model_name=runwayml/stable-diffusion-v1-5 --build-arg use_xformers=1 --build-arg model_revision=fp16 .

Run container. You need NVIDIA docker and a GPU.

docker run -p 80:8080 --gpus all -e AIP_HEALTH_ROUTE=/health -e AIP_HTTP_PORT=8080 -e AIP_PREDICT_ROUTE=/predict gcr.io/jfacevedo-demos/serving-sd:latest

Test the container locally.
```
python test_container.py > results.jsonl
```
results.jsonl will contain the response with the generated images.
Validate prediction. This will create an output folder with the generated images from the previous step.
```
python validate_response.py --response-json response.jsonl
```

Deploy in Vertex AI.

You'll need to enable Vertex AI and have authenticated with a service account that has the Vertex AI admin or editor role.

Push the image

gcloud auth configure-docker
docker push gcr.io/$PROJECT_ID/serving-sd:latest

Deploy in Vertex AI prediction.

python ../gcp_deploy.py --image-uri gcr.io/$PROJECT_ID/serving-sd:latest --model-name stable-diffusion --endpoint-name stable-diffusion-endpoint --endpoint-deployed-name stable-diffusion-deployed-name

The last command will display the endpoint name, it should look like projects/611558971877/locations/us-central1/endpoints/3386579376433790976:

Test the endpoint using the endpoint name.
```
python generate_request_vertex.py --endpoint-name projects/611558971877/locations/us-central1/endpoints/3386579376433790976
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

serving-stable-diffusion

serving-stable-diffusion

README.md

Stable Diffusion

Intro

Setup

Deploy in Vertex AI.

Files

serving-stable-diffusion

Directory actions

More options

Directory actions

More options

Latest commit

History

serving-stable-diffusion

Folders and files

parent directory

README.md

Stable Diffusion

Intro

Setup

Deploy in Vertex AI.