Releases: Blaizzy/mlx-vlm
Releases · Blaizzy/mlx-vlm
v0.1.0
What's Changed
- Add support for Pixtral-12B by @Blaizzy in #67
- Fix pixtral multi-image by @hiima234 in #41
- Added: Qwen2-VL Unit Tests, Refactored Weight Sanitization by @benzimring in #63
- Trainer + Multi image v0.1.0 by @Blaizzy in #41
- Fix example scripts in the readme.md to import and use load_config by @mark-lord in #82
- Qwen2-VL Improvements (1-2x speedup) by @Blaizzy in #89
- Fix Paligemma object detection and segmentation by @Blaizzy in #90
- Add support for Llama-3.2-vision & Resize image by @Blaizzy in #83
- Fix idefics-2 mask by @Blaizzy in #91
New Contributors
- @benzimring made their first contribution in #63
- @mark-lord made their first contribution in #82
Full Changelog: v0.0.15...v0.1.0
v0.0.15
v0.0.14
v0.0.13
v0.0.12
v0.0.11
v0.0.10
What's Changed
- Add support for phi-3-vision-128k-instruct by @JosefAlbers in #36
New Contributors
- @JosefAlbers made their first contribution in #36
Full Changelog: v0.0.9...v0.0.10
v0.0.9
v0.0.8
What's Changed
- Update load_image function to handle BytesIO input by @gabewillen in #34
- Add support for DeepSeek-VL by @Blaizzy in #37
New Contributors
- @gabewillen made their first contribution in #34
Full Changelog: v0.0.7...v0.0.8