Rereading-LLM-Reasoning EMNLP 2024 Main

⭐️ We are honored that Re2 is added into optillm, a repo to optimize LLMs' inference.

⭐️ The official code for the paper EMNLP 2024 "Re-reading improves reasoning in large language models".

Idea: Simply repeating the question to get "bidirectional" understanding for improving reasoning.

Requirements

pip install -r requirements.txt

Set your OpenAI Key:

export OPENAI_API_KEY=your_openai_key

Run

Run gpt-3.5-turbo-0613 on a dataset to test the code:

sh run_single_dataset.sh

Run gpt-3.5-turbo-0613 on all datasets:

sh run.sh

The parameter --multithread could be added to run the code in parallel to speed up the process a lot.

python main.py --dataset gsm --temperature 0.0 --acts vanilla cot --model gpt-3.5-turbo-0613 --read_times 2  --multithread

Run Self-Consistency Experiments:

python main.py --dataset gsm --temperature 0.7 --multithread --acts vanilla --num_threads 40 --majority_at 10 
python main.py --dataset gsm --temperature 0.7 --multithread --acts vanilla --num_threads 40 --majority_at 10  --read_times 2

Run Re2 on PAL and Plan-and-Solve prompting methods:

python main.py --dataset gsm --temperature 0.0 --multithread --acts ps pal --num_threads 40 
python main.py --dataset gsm --temperature 0.0 --verbose --debug --acts ps pal --num_threads 40 --read_times 2

Evaluation

Evaluation will be done automatically after the generation. Or you could run the evaluation code manually:

python eval.py --dataset gsm  --acts vanilla cot --eval_file  results/gsm_gpt-4o-mini-2024-07-18_topp1.0_temp0.0_majority1_readtimes1_20240926_221724.jsonl

Results Logs

Some results logs are in the results folder.

GPT-4o-mini + Re2

We also conduct experiments on GPT-4o-mini. Change the --model parameter to gpt-4o-mini-2024-07-18 to run the experiments on GPT-4o-mini. The results are shown as follows:

Citation

If you find this work helpful, please consider citing our paper:

@inproceedings{xu-etal-2024-reading,
    title = "Re-Reading Improves Reasoning in Large Language Models",
    author = "Xu, Xiaohan  and
      Tao, Chongyang  and
      Shen, Tao  and
      Xu, Can  and
      Xu, Hongbo  and
      Long, Guodong  and
      Lou, Jian-Guang  and
      Ma, Shuai",
    editor = "Al-Onaizan, Yaser  and
      Bansal, Mohit  and
      Chen, Yun-Nung",
    booktitle = "Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing",
    month = nov,
    year = "2024",
    address = "Miami, Florida, USA",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2024.emnlp-main.871",
    pages = "15549--15575"
}

Acknowledgement

We refer to the code of PAL. Thanks for their contributions.

Modification: Implemented the Re2 method based on the PAL codebase.
Date: March 2024

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
__pycache__		__pycache__
pal		pal
results		results
scripts		scripts
utils		utils
.DS_Store		.DS_Store
LICENSE.txt		LICENSE.txt
README.md		README.md
eval.py		eval.py
gpt-4o-mini.jpg		gpt-4o-mini.jpg
intro.jpg		intro.jpg
main.py		main.py
prompter.py		prompter.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Rereading-LLM-Reasoning EMNLP 2024 Main

Requirements

Run

Evaluation

Results Logs

GPT-4o-mini + Re2

Citation

Acknowledgement

About

Releases

Packages

Languages

License

Tebmer/Rereading-LLM-Reasoning

Folders and files

Latest commit

History

Repository files navigation

Rereading-LLM-Reasoning EMNLP 2024 Main

Requirements

Run

Evaluation

Results Logs

GPT-4o-mini + Re2

Citation

Acknowledgement

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages