Skip to content

EntroShape/llm-caching-multiplexing

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

On Optimal Caching and Model Multiplexing for Large Model Inference

Installation

pip install -e .

Run Experiments

cd inferband
python run_exp.py
python plot.py

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 54.7%
  • Python 41.3%
  • TeX 2.5%
  • Shell 1.5%