Models to port to MLX-VLM #39

Blaizzy · 2024-06-11T12:10:59Z

Blaizzy · 2024-06-22T15:53:36Z

Next release of Llava-Next

TODO:
update text config defaults to avoid errors with Llava-v1.6-vicuna:

class TextConfig:
    model_type: str
    hidden_size: int = 4096
    num_hidden_layers: int = 32
    intermediate_size: int = 11008
    num_attention_heads: int = 32
    rms_norm_eps: float = 1e-05
    vocab_size: int = 32064
    num_key_value_heads: int = 32
    rope_theta: float = 1000000
    rope_traditional: bool = False
    rope_scaling: Optional[Dict[str, Union[float, str]]] = None

BoltzmannEntropy · 2024-07-31T18:26:28Z

Thanks for the great repo. This should also be on the list: https://github.com/THUDM/CogVLM2
I am now just reading the code, and trying to free some time for the conversion routine.

jrp2014 · 2024-08-08T18:18:08Z

https://llava-vl.github.io/blog/2024-08-05-llava-onevision/

Blaizzy · 2024-08-08T20:27:45Z

Hey @BoltzmannEntropy and @jrp2014,

Thanks for the suggestions!

I have added them to the backlog

jrp2014 · 2024-08-27T17:41:55Z

MiniCPM-V v2.6

jrp2014 · 2024-08-27T17:42:30Z

MiniCPM-V v2.6

s-smits · 2024-09-07T10:44:22Z

Do you have a link to Florence-2?

ChristianWeyer · 2024-09-10T05:54:38Z

Is the above list the ultimate and up-to-date list of supported models @Blaizzy? Thanks for your hard work!

Blaizzy · 2024-09-10T12:17:37Z

Hey @ChristianWeyer
Its mostly up-to-date, just missing qwen2-vl

Blaizzy · 2024-09-10T12:18:38Z

@s-smits here you go:

https://huggingface.co/microsoft/Florence-2-large/blob/main/modeling_florence2.py

ChristianWeyer · 2024-09-10T13:50:00Z

[x] Phi-3-vision

Thanks!
I guess Phi-3-vision includes 3.5?

Blaizzy · 2024-09-10T13:59:50Z

Yes, they have the same arch so there are no changes needed :)

pulkitjindal88 · 2024-09-20T15:27:37Z

Hey @Blaizzy, thanks for this great framework. Is there any priority for InternVL? I can see it is present in your list. Just wanted to know if it planned in your near term. Want to make the model run on my macbook and mlx-vlm looks to be the best way for that.

chigkim · 2024-09-21T22:27:26Z

Qwen2-VL-72B would be amazing!

simonw · 2024-09-29T21:28:25Z

This recipe seems to work for Qwen2-VL-2B-Instruct:

python -m mlx_vlm.generate \
  --model Qwen/Qwen2-VL-2B-Instruct \
  --max-tokens 100 \
  --temp 0.0 \
  --image django-roadmap.png \
  --prompt "Describe image in detail, include all text"

My results here: https://gist.github.com/simonw/9e02d425cacb902260ec1307e0671e17

chigkim · 2024-09-30T00:13:52Z

Yep they just merged Qwen2-vl support this weekend.

xSNYPSx · 2024-10-02T00:18:09Z

Molmo please

chigkim · 2024-10-02T17:41:21Z

Nvidia just dropped multimodal NVLM-D-72B. The benchmark looks pretty good.

https://huggingface.co/nvidia/NVLM-D-72B

Blaizzy · 2024-10-02T19:03:08Z

Yap, that's a pretty awesome model!
It's on my radar because we can run it in 4bit quant

chigkim · 2024-10-25T20:33:45Z

Pixtral-12B now has Base model.
https://huggingface.co/mistralai/Pixtral-12B-Base-2409

Blaizzy added the good first issue Good for newcomers label Jun 11, 2024

This was referenced Jun 15, 2024

LlaVA in MLX ml-explore/mlx-examples#461

Merged

Add support for Llava-1.6 ml-explore/mlx-examples#551

Open

Llava v1.6 support #42

Closed

Blaizzy mentioned this issue Jun 24, 2024

Add support for phi-3-vision-128k-instruct #36

Merged

Blaizzy pinned this issue Jul 4, 2024

Blaizzy mentioned this issue Jul 6, 2024

Where can I get started to convert internvl model to mlx format? ml-explore/mlx-examples#865

Closed

maazel-rhymes mentioned this issue Oct 23, 2024

running on MacOS-M2 GPUs rhymes-ai/Aria#34

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Models to port to MLX-VLM #39

Models to port to MLX-VLM #39

Blaizzy commented Jun 11, 2024 •

edited

Loading

Blaizzy commented Jun 22, 2024

BoltzmannEntropy commented Jul 31, 2024

jrp2014 commented Aug 8, 2024

Blaizzy commented Aug 8, 2024

jrp2014 commented Aug 27, 2024

jrp2014 commented Aug 27, 2024

s-smits commented Sep 7, 2024

ChristianWeyer commented Sep 10, 2024

Blaizzy commented Sep 10, 2024

Blaizzy commented Sep 10, 2024

ChristianWeyer commented Sep 10, 2024

Blaizzy commented Sep 10, 2024

pulkitjindal88 commented Sep 20, 2024

chigkim commented Sep 21, 2024

simonw commented Sep 29, 2024 •

edited

Loading

chigkim commented Sep 30, 2024

xSNYPSx commented Oct 2, 2024

chigkim commented Oct 2, 2024

Blaizzy commented Oct 2, 2024

chigkim commented Oct 25, 2024

Models to port to MLX-VLM #39

Models to port to MLX-VLM #39

Comments

Blaizzy commented Jun 11, 2024 • edited Loading

Blaizzy commented Jun 22, 2024

BoltzmannEntropy commented Jul 31, 2024

jrp2014 commented Aug 8, 2024

Blaizzy commented Aug 8, 2024

jrp2014 commented Aug 27, 2024

jrp2014 commented Aug 27, 2024

s-smits commented Sep 7, 2024

ChristianWeyer commented Sep 10, 2024

Blaizzy commented Sep 10, 2024

Blaizzy commented Sep 10, 2024

ChristianWeyer commented Sep 10, 2024

Blaizzy commented Sep 10, 2024

pulkitjindal88 commented Sep 20, 2024

chigkim commented Sep 21, 2024

simonw commented Sep 29, 2024 • edited Loading

chigkim commented Sep 30, 2024

xSNYPSx commented Oct 2, 2024

chigkim commented Oct 2, 2024

Blaizzy commented Oct 2, 2024

chigkim commented Oct 25, 2024

Blaizzy commented Jun 11, 2024 •

edited

Loading

simonw commented Sep 29, 2024 •

edited

Loading