v0.15.4
What's new in 0.15.4 (2024-10-12)
These are the changes in inference v0.15.4.
New features
- FEAT: Llama 3.1 Instruct support tool call by @codingl2k1 in #2388
- FEAT: qwen2.5 instruct tool call by @codingl2k1 in #2393
- FEAT: add whisper-large-v3-turbo audio model by @hwzhuhao in #2409
- FEAT: Add environment variable setting to increase the retry attempts after model download failures by @hwzhuhao in #2411
- FEAT: support getting progress for image model by @qinxuye in #2395
- FEAT: support qwenvl2 vllm engine by @amumu96 in #2428
Enhancements
- ENH: Launch the ChatTTS model with kwargs by @codingl2k1 in #2425
- REF: refactor controlnet for image model by @qinxuye in #2346
Bug fixes
- BUG: Pin ChatTTS<0.2 by @codingl2k1 in #2419
- BUG: tool call streaming output has duplicated list by @ChengjieLi28 in #2416
Full Changelog: v0.15.3...v0.15.4