Skip to content

v0.15.4

Compare
Choose a tag to compare
@XprobeBot XprobeBot released this 12 Oct 10:38
· 75 commits to main since this release
c0be115

What's new in 0.15.4 (2024-10-12)

These are the changes in inference v0.15.4.

New features

  • FEAT: Llama 3.1 Instruct support tool call by @codingl2k1 in #2388
  • FEAT: qwen2.5 instruct tool call by @codingl2k1 in #2393
  • FEAT: add whisper-large-v3-turbo audio model by @hwzhuhao in #2409
  • FEAT: Add environment variable setting to increase the retry attempts after model download failures by @hwzhuhao in #2411
  • FEAT: support getting progress for image model by @qinxuye in #2395
  • FEAT: support qwenvl2 vllm engine by @amumu96 in #2428

Enhancements

Bug fixes

Full Changelog: v0.15.3...v0.15.4