Skip to content

shah-zeb-naveed/azure-llm-deployments

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 

Repository files navigation

azure-llm-deployments

This repository contains the deployment commands and manifest files for deploying Hugging Face Text Generation Inference (both on Azure Container Instance as well as Azure Kubernetes Service).

For AKS in specific, following components are used:

Azure Resources:

  • Azure Container Registry
  • Azure Cosmos DB (MongoDB vCore)

Frontend:

  • Hugging Face Chat UI

LLM Server:

  • Hugging Face Text Generation Inference

Reverse Proxy:

  • K8S Nginx Ingress Controller (Recommended Method for AKS)
  • Sidecar pattern
    • Nginx
    • Caddy

Full guides published on Medium: https://medium.com/@shahzebnaveed

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages