Skip to content

v0.4.0

Latest
Compare
Choose a tag to compare
@ajindal1 ajindal1 released this 22 Aug 20:26
b77e768

Release Notes

  • Support for new models such as Qwen 2, LLaMA 3.1, Gemma 2, Phi-3 small on CPU
  • Support to build already-quantized models that were quantized with AWQ or GPTQ
  • Performance improvements for Intel and Arm CPU
  • Packing and language binding
    • Added Java bindings (build from source)
    • Separate OnnxRuntime.dll and directml.dll out of GenAI package to improve usability
    • Publish packages for Win Arm
    • Support for Android (build from source)