does not have support for mistral, gemma, etc and generate error [BUG] ? #27

NickyDark1 · 2024-03-01T23:59:28Z

model_id = "h2oai/h2o-danube-1.8b-chat"#

Upvote & Fund

We're using Polar.sh so you can upvote and help fund this issue.
We receive the funding once the issue is completed & confirmed by you.
Thank you in advance for helping prioritize & fund our backlog.

NickyDark1 · 2024-03-02T00:00:48Z

version: 4.36.2 new -> transformers==4.38.0 (no support)

NickyDark1 · 2024-03-02T00:01:54Z

only support this model?

Load a model from Hugging Face's Transformers

model_name = "bert-base-uncased"

NickyDark1 · 2024-03-02T00:03:51Z

no support:

cuda()
to("cuda:0")

sanjeev-bhandari · 2024-04-25T06:28:11Z

@NickyDark1, I ran that model in colab and it work

Without quanitizing

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("h2oai/h2o-danube-1.8b-chat")
model = AutoModelForCausalLM.from_pretrained("h2oai/h2o-danube-1.8b-chat")

# from transformers import pipeline

pipe = pipeline("text2text-generation", model=model, tokenizer=tokenizer)
pipe("Hello, How")

Output:

[{'generated_text': 'Hello, How are you?\n\n"I\'m doing well, thank you. How about'}]

After replacing Linear layer with bitnet

from bitnet import replace_linears_in_hf

replace_linears_in_hf(model)
# change model back to device cuda
model.to("cuda")
pipe_1_bit = pipeline("text-generation", model=model, tokenizer=tokenizer)
pipe_1_bit("Hello, How")

Output is:

[{'generated_text': 'Hello, How島 waters everyoneürgen Mess till revel馬 Vitt officials ambos">< czł plusieurs ap riv居'}]

But it takes ages to give this answer(8 min in my case in free colab).

github-actions · 2024-06-24T12:48:06Z

Stale issue message

NickyDark1 added the bug Something isn't working label Mar 1, 2024

NickyDark1 assigned kyegomez Mar 1, 2024

github-actions bot added the no-issue-activity label Jun 24, 2024

github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Jul 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

does not have support for mistral, gemma, etc and generate error [BUG] ? #27

does not have support for mistral, gemma, etc and generate error [BUG] ? #27

NickyDark1 commented Mar 1, 2024 •

edited by polar-sh bot

Loading

NickyDark1 commented Mar 2, 2024

NickyDark1 commented Mar 2, 2024

NickyDark1 commented Mar 2, 2024

sanjeev-bhandari commented Apr 25, 2024

github-actions bot commented Jun 24, 2024

does not have support for mistral, gemma, etc and generate error [BUG] ? #27

does not have support for mistral, gemma, etc and generate error [BUG] ? #27

Comments

NickyDark1 commented Mar 1, 2024 • edited by polar-sh bot Loading

Upvote & Fund

NickyDark1 commented Mar 2, 2024

NickyDark1 commented Mar 2, 2024

Load a model from Hugging Face's Transformers

NickyDark1 commented Mar 2, 2024

sanjeev-bhandari commented Apr 25, 2024

Without quanitizing

After replacing Linear layer with bitnet

github-actions bot commented Jun 24, 2024

NickyDark1 commented Mar 1, 2024 •

edited by polar-sh bot

Loading