From c3d1b52f10cbea52f28d67d9c30c0d07fd018112 Mon Sep 17 00:00:00 2001 From: Yuan Tang Date: Sun, 20 Oct 2024 07:53:53 -0400 Subject: [PATCH] Separate Secret and ISVC YAML for Hugging Face runtime example (#409) * Separate Secret and ISVC YAML for Hugging Face runtime example Signed-off-by: Yuan Tang * Update README.md Signed-off-by: Dan Sun --------- Signed-off-by: Yuan Tang Signed-off-by: Dan Sun Co-authored-by: Dan Sun --- .../v1beta1/llm/huggingface/text_generation/README.md | 4 ++++ 1 file changed, 4 insertions(+) diff --git a/docs/modelserving/v1beta1/llm/huggingface/text_generation/README.md b/docs/modelserving/v1beta1/llm/huggingface/text_generation/README.md index 225da1eb3..fd416c278 100644 --- a/docs/modelserving/v1beta1/llm/huggingface/text_generation/README.md +++ b/docs/modelserving/v1beta1/llm/huggingface/text_generation/README.md @@ -13,6 +13,7 @@ If the model is not supported by vLLM, KServe falls back to HuggingFace backend Create a secret with the Hugging Face token. === "Yaml" + ```yaml apiVersion: v1 kind: Secret @@ -23,6 +24,8 @@ Create a secret with the Hugging Face token. HF_TOKEN: ``` +Then create the inference service. + === "Yaml" ```yaml @@ -190,6 +193,7 @@ Create a secret with the Hugging Face token. stringData: HF_TOKEN: ``` +Then create the inference service. === "Yaml"