v0.15.3
What's new in 0.15.3 (2024-09-30)
These are the changes in inference v0.15.3.
New features
- Feat: Support jina-embedding-v3 by @amumu96 in #2379
- FEAT: Support deepcache with sd models by @frostyplanet in #2313
- FEAT: support minicpm-reranker model by @hwzhuhao in #2383
- FEAT: add vllm restart check and support internvl multi-image chat by @amumu96 in #2384
Bug fixes
- BUG: [UI] Fix 'Model Format' bug on model registration page. by @yiboyasss in #2353
- BUG: Fix default value of max_model_len for vLLM backend. by @zjuyzj in #2385
New Contributors
Full Changelog: v0.15.2...v0.15.3