Skip to content

v0.0.106

Latest
Compare
Choose a tag to compare
@github-actions github-actions released this 15 Jan 00:40
  1. Remove the VRAM occupation when zero offloading: -ngl 0;
  2. Fix rerank model loading error: gpustack/gte-multilingual-reranker-base-GGUF, gpustack/jina-reranker-v2-base-multilingual-GGUF
  3. Support tool calling in ChatGLM4 series;
  4. Introduce DDIM(ddim_trailing) sample method;
  5. Support multiple devices offloading image model.

image
image