Skip to content

0.2.3

Latest
Compare
Choose a tag to compare
@lucasnewman lucasnewman released this 13 Dec 18:03
· 1 commit to main since this release

What's Changed

Added support for quantized models. 4-bit and 8-bit are supported using the --q flag.

New Contributors

Full Changelog: 0.2.2...0.2.3