Skip to content

Commit

Permalink
Fix typo in 2024-05-15-KServe-0.13-release.md (#376)
Browse files Browse the repository at this point in the history
Signed-off-by: Daniele Zonca <dzonca@redhat.com>
  • Loading branch information
danielezonca authored Jun 19, 2024
1 parent c167afa commit 02b3d6d
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion docs/blog/articles/2024-05-15-KServe-0.13-release.md
Original file line number Diff line number Diff line change
Expand Up @@ -88,7 +88,7 @@ These endpoints are useful for generative transformer models, which take in mess
This update fosters a standardized approach to transformer model serving, ensuring compatibility with a broader spectrum of models and tools, and enhances the platform's versatility. The API can be directly used with OpenAI's client libraries or third-party tools, like LangChain or LlamaIndex.

### Future Plan
* Support other tasks like text embeddings [#3572](https://github.com/kserve/kserve/issues/3572])
* Support other tasks like text embeddings [#3572](https://github.com/kserve/kserve/issues/3572).
* Support more LLM backend options in the future, such as TensorRT-LLM.
* Enrich text generation metrics for Throughput(tokens/sec), TTFT(Time to first token) [#3461](https://github.com/kserve/kserve/issues/3461).
* KEDA integration for token based LLM Autoscaling [#3561](https://github.com/kserve/kserve/issues/3561).
Expand Down

0 comments on commit 02b3d6d

Please sign in to comment.