Skip to content

About The official GitHub page for ''DAWN-ICL: Strategic Planning of Problem-solving Trajectories for Zero-Shot In-Context Learning'' Resources

Notifications You must be signed in to change notification settings

txy77/MCTS4ZSICL

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 

Repository files navigation

DAWN-ICL: Strategic Planning of Problem-solving Trajectories for Zero-Shot In-Context Learning

😀 Overview

  • We are the first to formalize ZS-ICL as a planning problem, which is closer to real-world scenarios.

  • We propose a novel demonstration-aware MCTS for ZS-ICL to achieve a more effective and efficient search for the problem-solving trajectories.

  • Extensive experiments demonstrate the effectiveness of our approach on in-domain and cross-domain scenarios, and it even outperforms ICL using human-annotated demonstrations.

🚀 Quick Start

Requirements

  • python == 3.11.9
  • pytorch == 2.3.1
  • transformers == 4.42.4
  • accelerate == 0.33.0
  • openai==1.35.14

Download Models

Download models from huggingface, open the src/utils.py file and update the directory paths in lines 34-41.

Parameter Settings

  • model: large language models
  • method: method of zero-shot in-context learning: ZS, FS, SelfICL, DAIL, Search
  • dataset: evaluation dataset: bbh, bbh-mini, mmlu
  • shot_num: the shot number of in-context learning
  • select_strategy: the demonstration selection strategy of in-context learning
  • diverse_candidate: the number of retrieved candidates: $k_d$
  • search_strategy: the search strategy of zero-shot in-context learning: Greedy, MC, Beam_Search, MCTS
  • beam_search_expand_num: the expansion number of the beam search
  • select_num: the number of node retained of the beam search
  • expansion_num: the expansion number of MCTS: $k_a$
  • iterative_num: the iteration number of MCTS
  • uct: the type of UCT in the selection phase: random, uct, demo_uct
  • w_exp: $w_a$
  • c_exp: $w_Q$
  • use_cache: whether to use the cache strategy
  • epsilon: $\epsilon$
  • aggregation: whether to use the aggregation strategy
  • calibration: whether to use the calibration strategy

About

About The official GitHub page for ''DAWN-ICL: Strategic Planning of Problem-solving Trajectories for Zero-Shot In-Context Learning'' Resources

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages