Create the conda environment

conda env create -f environment.yml

Reproduce results for the prompt compressors

The experiments in this step must be run before running the code for the optimal R-D curves.

Synthetic dataset

Navigate to the toy_dataset folder.

Launch the training runs to fine-tune the models with

bash finetune.sh

Now gather the prompt compression results by issuing

bash inference.sh

NLP datasets

Training

Navigate to the nlp_dataset/LLMLingua/experiments/llmlingua2/data_collection folder and run

bash collect_data.sh

to form the dataset for training the QuerySelect and Adaptive QuerySelect encoder model. Then, launch the training script in nlp_dataset/LLMLingua/experiments/llmlingua2/model_training with

bash train.sh

Alternatively, the model weights are available on Hugging Face

Inference

Navigate to the nlp_dataset folder.

Run the experiments for the prompt compression methods by running

bash inference.sh

Experiments for NarrativeQA can be run with

bash narrativeqa.sh

However, running narrativeqa.sh will take an especially long time. It is recommended to scatter the jobs across multiple GPUs. The beam search job alone takes approximately 1200 A100 GPU hours.

Reproduce the optimal R-D curves

Navigate to the optimal_rd folder.

The file RD_dual_LP_solver.py can be called with the following arguments:

python RD_dual_LP_solver.py [--data_path DATA_PATH] [--condition CONDITION] [--rate_vals RATE_VALS] [--wandb_project WANDB_PROJECT]

DATA_PATH is the name of the .jsonl file from the 'datasets' folder with the data needed to compute the distortion-rate function (default is 'Mistral-7B-Instruct-v0.2_optimal.jsonl')
CONDITION can take 4 values: -- 0: computes the query-agnostic curve (default) -- 1: computes the average query-aware curve -- 2: computes the conditional query-aware curves (one for each query) -- 3: does all of the above and stores it in a file 'output_RD/optimal_RD_<DATA_PATH>.json'
RATE_VALS is the list of rate values at which the distortion-rate curve is to be plotted
WANDB_PROJECT is the name of the project if using wandb (default is 'rd-lp-dual-eval')

Examples:

To compute the query-agnostic optimal curve for the file 'Mistral-7B-Instruct-v0.2_optimal.jsonl' at certain rate values, run python RD_dual_LP_solver.py --data_path 'Mistral-7B-Instruct-v0.2_optimal.jsonl' --condition 0 --rate_vals 0.12 0.13 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5 0.55 0.6 0.65 0.7 0.75 0.8 0.85 0.9 0.95
To compute the conditional query-aware optimal curve for the file 'Mistral-7B-Instruct-v0.2_optimal_forced_ft.jsonl' at certain rate values, run python RD_dual_LP_solver.py --data_path 'Mistral-7B-Instruct-v0.2_optimal_forced_ft.jsonl' --condition 2 --rate_vals 0.12 0.13 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5 0.55 0.6 0.65 0.7 0.75 0.8 0.85 0.9 0.95
To obtain all optimal curves for the file 'Mistral-7B-Instruct-v0.2_accuracy_optimal_ft.jsonl' at certain rate values, run python RD_dual_LP_solver.py --data_path 'Mistral-7B-Instruct-v0.2_accuracy_optimal_ft.jsonl' --condition 3 --rate_vals 0.12 0.13 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5 0.55 0.6 0.65 0.7 0.75 0.8 0.85 0.9 0.95

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
Selective_Context		Selective_Context
nlp_dataset		nlp_dataset
toy_dataset		toy_dataset
.gitignore		.gitignore
README.md		README.md
environment.yml		environment.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Create the conda environment

Reproduce results for the prompt compressors

Synthetic dataset

NLP datasets

Training

Inference

Reproduce the optimal R-D curves

About

Releases

Packages

Languages

acnagle/fundamental-limits

Folders and files

Latest commit

History

Repository files navigation

Create the conda environment

Reproduce results for the prompt compressors

Synthetic dataset

NLP datasets

Training

Inference

Reproduce the optimal R-D curves

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages