Fine-tuning Open-source LLaMa with QLoRa Scripts Repository

This repository contains all scripts and resources used in the article 'How to fine-tune an open-source LLaMa using QLoRa'.

Install pytorch

conda install pytorch torchvision torchaudio pytorch-cuda=11.7 -c pytorch -c nvidia

Install GPTQ-for-LLaMa

mkdir repositories
cd repositories
git clone https://github.com/qwopqwop200/GPTQ-for-LLaMa.git -b cuda

cd GPTQ-for-LLaMa
pip install -r requirements.txt
python setup_cuda.py install

Installation QLoRa dependencies

In the root directory

pip install -U -r requirements.txt

Fine-tune the model

Update the dataset configuration based on your data format here -> https://github.com/mzbac/qlora-fine-tune/blob/main/qlora.py#L521-L527

python qlora.py --model_name_or_path TheBloke/wizardLM-13B-1.0-fp16 --dataset my-data --bf16

Inference with LoRa adapters

python inference.py

Note: Change the model_name and adapters_name accordingly

Merge LoRa adapters back to base model

python merge_peft_adapters.py --device cpu --base_model_name_or_path TheBloke/wizardLM-13B-1.0-fp16 --peft_model_path ./output/checkpoint-2250/adapter_model --output_dir ./merged_models/

Quantization Model

python repositories/GPTQ-for-LLaMa/llama.py ${MODEL_DIR} c4 --wbits 4 --true-sequential --groupsize 128 --save_safetensors {your-model-name}-no-act-order-4bit-128g.safetensors

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.gitignore		.gitignore
README.md		README.md
convert-chatgpt-conversation.py		convert-chatgpt-conversation.py
inference.py		inference.py
merge_peft_adapters.py		merge_peft_adapters.py
qlora.py		qlora.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fine-tuning Open-source LLaMa with QLoRa Scripts Repository

Install pytorch

Install GPTQ-for-LLaMa

Installation QLoRa dependencies

Fine-tune the model

Inference with LoRa adapters

Merge LoRa adapters back to base model

Quantization Model

About

Releases

Packages

Languages

mzbac/qlora-fine-tune

Folders and files

Latest commit

History

Repository files navigation

Fine-tuning Open-source LLaMa with QLoRa Scripts Repository

Install pytorch

Install GPTQ-for-LLaMa

Installation QLoRa dependencies

Fine-tune the model

Inference with LoRa adapters

Merge LoRa adapters back to base model

Quantization Model

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages