From-Scratch Python Implementation of DPO on GPT-2 (124M)

This repository contains code for aligning GPT-2 124M using DPO, and also performing SFT using the same DPO dataset.

Credit to @karpathy for the GPT-2 PyTorch architecture and @graykode for the custom tokenizer.

Installation

git clone https://github.com/aalokpatwa/dpo.git
cd dpo
conda create -n dpo_gpt2 python=3.9
conda activate dpo_gpt2
pip install -r requirements.txt

Usage

Run all scripts from the root directory of the repo.

Training a model:

python train.py [--dataset] [--results_dir] [--loss] [--batch_size] [--epochs] [--lr] [--beta]
Example: python train.py --loss dpop --epochs 2 --beta 0.5

dataset should point to the JSON file containing the data. loss can have four options: dpo, dpop, sft, and kl_sft.

After a model is trained, you can sample completions from a test dataset and write the results to a CSV:

python generate_completions.py [--dataset] [--model] [--results_dir]
Example: python3 generate_completions.py --dataset dataset/upenn_test.json --model dpop

After this, if you would like to evaluate the generations using GPT-4, create a .env file in the root and add your OpenAI API key. Then, you can run

python alignment_accuracies.py [--results_file]
python win_rates.py --results_files

Example: 
python alignment_accuracies.py --results_file results/dpop_results.csv
python win_rates.py --results_files results/dpo_results.csv,results/dpop_results.csv,results/hf_results.csv

Key Files

The training loop is in train.py, the model architecture is in dpo/model.py, and the loss functions are in dpo/loss.py.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

From-Scratch Python Implementation of DPO on GPT-2 (124M)

Installation

Usage

Key Files

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
dataset		dataset
dpo		dpo
results		results
README.md		README.md
alignment_accuracy.py		alignment_accuracy.py
generate_completions.py		generate_completions.py
hf_dpo.py		hf_dpo.py
requirements.txt		requirements.txt
train.py		train.py
win_rates.py		win_rates.py

aalokpatwa/dpo

Folders and files

Latest commit

History

Repository files navigation

From-Scratch Python Implementation of DPO on GPT-2 (124M)

Installation

Usage

Key Files

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages