On Optimal Caching and Model Multiplexing for Large Model Inference Installation pip install -e . Run Experiments cd inferband python run_exp.py python plot.py