bitsandbytes be decoupled from huggingface #549

RanchiZhao · 2023-06-28T15:16:42Z

I'd like to know if bitsandbytes can be decoupled from huggingface, or if they have to be used together. In addition, is the int4 quantization process completed during the get_accelerate_model phase and unrelated to the subsequent training with Trainer? Or, at what point does the dequantization process occur? This is because I've noticed that int4 quantization alters the shape of the linear layer.

RanchiZhao · 2023-07-07T05:12:48Z

Recently, I have finally adapted QLoRa for frameworks other than HuggingFace. The focus of the adaptation is primarily on the operators in Linear4bit and a series of quantized optimizers. Of course, during this process there are quite al lot dtype errors.

rayrayraykk · 2023-09-07T04:05:08Z

could you share your ideas or repo? Many thanks!

RanchiZhao · 2023-09-07T06:49:22Z

could you share your ideas or repo? Many thanks!

you could see this, i hope it will be helpful: RanchiZhao/bmtrain_qlora#1 (comment)

RanchiZhao closed this as completed Jul 7, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bitsandbytes be decoupled from huggingface #549

bitsandbytes be decoupled from huggingface #549

RanchiZhao commented Jun 28, 2023

RanchiZhao commented Jul 7, 2023

rayrayraykk commented Sep 7, 2023

RanchiZhao commented Sep 7, 2023

bitsandbytes be decoupled from huggingface #549

bitsandbytes be decoupled from huggingface #549

Comments

RanchiZhao commented Jun 28, 2023

RanchiZhao commented Jul 7, 2023

rayrayraykk commented Sep 7, 2023

RanchiZhao commented Sep 7, 2023