Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

Blaizzy / mlx-vlm Public

Notifications You must be signed in to change notification settings
Fork 35
Star 469

Code
Issues 22
Pull requests 3
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Releases: Blaizzy/mlx-vlm

Releases · Blaizzy/mlx-vlm

v0.1.0

18 Oct 00:15

Blaizzy

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v0.1.0 Latest

Latest

What's Changed

Add support for Pixtral-12B by @Blaizzy in #67
Fix pixtral multi-image by @hiima234 in #41
Added: Qwen2-VL Unit Tests, Refactored Weight Sanitization by @benzimring in #63
Trainer + Multi image v0.1.0 by @Blaizzy in #41
Fix example scripts in the readme.md to import and use load_config by @mark-lord in #82
Qwen2-VL Improvements (1-2x speedup) by @Blaizzy in #89
Fix Paligemma object detection and segmentation by @Blaizzy in #90
Add support for Llama-3.2-vision & Resize image by @Blaizzy in #83
Fix idefics-2 mask by @Blaizzy in #91

New Contributors

@benzimring made their first contribution in #63
@mark-lord made their first contribution in #82

Full Changelog: v0.0.15...v0.1.0

Contributors

Blaizzy, benzimring, and 2 other contributors

Assets 2

Loading

lin72h and 6 reacted with hooray emoji

All reactions

🎉 2 reactions

2 people reacted

v0.0.15

29 Sep 00:24

Blaizzy

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v0.0.15

What's Changed

Qwen2-VL fix vision tower bug for HD imagges by @Blaizzy in #62

Full Changelog: v0.0.14...v0.0.15

Contributors

Blaizzy

Assets 2

Loading

Goekdeniz-Guelmez reacted with thumbs up emoji

amirhossein-razlighi reacted with rocket emoji

All reactions

👍 1 reaction
🚀 1 reaction

2 people reacted

v0.0.14

28 Sep 16:14

Blaizzy

Compare

Choose a tag to compare

Loading

v0.0.14

What's Changed

Add support for Qwen2-VL by @Blaizzy in #59

Full Changelog: v0.0.13...v0.0.14

Contributors

Blaizzy

Assets 2

Loading

Goekdeniz-Guelmez reacted with hooray emoji

All reactions

🎉 1 reaction

1 person reacted

v0.0.13

16 Aug 20:52

Blaizzy

Compare

Choose a tag to compare

Loading

v0.0.13

What's Changed

fix chat template bug by @Blaizzy in #55

Full Changelog: v0.0.12...v0.0.13

Contributors

Blaizzy

Assets 2

Loading

All reactions

v0.0.12

02 Aug 09:18

Blaizzy

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v0.0.12

What's Changed

Refactor KVCache and add stream generate by @Blaizzy in #53
Fix gradio app generation by @Blaizzy in #54

Full Changelog: v0.0.11...v0.0.12

Contributors

Blaizzy

Assets 2

Loading

All reactions

v0.0.11

04 Jul 14:09

Blaizzy

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v0.0.11

What's Changed

Add Dolphin-vision and Bunny by @Blaizzy in #50

Full Changelog: v0.0.10...v0.0.11

Contributors

Blaizzy

Assets 2

Loading

All reactions

v0.0.10

24 Jun 16:19

Blaizzy

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v0.0.10

What's Changed

Add support for phi-3-vision-128k-instruct by @JosefAlbers in #36

New Contributors

@JosefAlbers made their first contribution in #36

Full Changelog: v0.0.9...v0.0.10

Contributors

JosefAlbers

Assets 2

Loading

All reactions

v0.0.9

22 Jun 10:30

Blaizzy

Compare

Choose a tag to compare

Loading

v0.0.9

What's Changed

Add support for Llava-Next (v1.6) by @Blaizzy in #43

Full Changelog: v0.0.8...v0.0.9

Contributors

Blaizzy

Assets 2

Loading

lin72h reacted with hooray emoji

All reactions

🎉 1 reaction

1 person reacted

v0.0.8

08 Jun 14:10

Blaizzy

Compare

Choose a tag to compare

Loading

v0.0.8

What's Changed

Update load_image function to handle BytesIO input by @gabewillen in #34
Add support for DeepSeek-VL by @Blaizzy in #37

New Contributors

@gabewillen made their first contribution in #34

Full Changelog: v0.0.7...v0.0.8

Contributors

Blaizzy and gabewillen

Assets 2

Loading

lin72h reacted with thumbs up emoji

lin72h reacted with hooray emoji

All reactions

👍 1 reaction
🎉 1 reaction

1 person reacted

v0.0.7

25 May 19:18

Blaizzy

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v0.0.7

What's Changed

Fix idefics2 OCR by @Blaizzy in #31

Full Changelog: v0.0.6...v0.0.7

Contributors

Blaizzy

Assets 2

Loading

All reactions

Previous 1 2 Next

Previous Next

Footer

© 2024 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.