GitHub - mila-iqia/Conscious-Planning: Implementation for paper "A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning".

A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

By Mingde "Harry" Zhao, Zhen Liu, Sitao Luan, Shuyuan Zhang, Doina Precup and Yoshua Bengio

(BLOGPOST)

Install Dependencies

pip install -r requirements.txt

Reproducing Results

CP

python run_distshift_randomized_mp.py --method DQN_CP --num_explorers 8 --ignore_model 0 --disable_bottleneck 0 --size_bottleneck 8

UP

python run_distshift_randomized_mp.py --method DQN_CP --num_explorers 8 --ignore_model 0 --disable_bottleneck 1

WM

python run_distshift_randomized_mp.py --method DQN_WM --num_explorers 8 --ignore_model 0 --disable_bottleneck 0 --size_bottleneck 8 --period_warmup 1000000

Dyna

python run_distshift_randomized_mp.py --prioritized_replay 0 --method DQN_Dyna --num_explorers 8 --ignore_model 0 --disable_bottleneck 0 --size_bottleneck 8 --learn_dyna_model 1

Special thanks to my colleague and friend Safa Alver @alversafa for pointing out that Dyna should not use prioritized buffer as it shouldn't prioritize on the errors generated by potentially inaccurate imagined transitions, as well as the runtime bugs surrounding this matter!

Dyna*

python run_distshift_randomized_mp.py --method DQN_Dyna --num_explorers 8 --ignore_model 0 --disable_bottleneck 0 --size_bottleneck 8 --learn_dyna_model 0

NOSET

python run_distshift_randomized_mp.py --method DQN_NOSET --num_explorers 8 --ignore_model 0 --layers_model 2 --len_hidden 256

Changing Settings

Read run_distshift_randomized_mp.py!

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
.gitignore		.gitignore
CP_Poster.png		CP_Poster.png
DQN_CP.py		DQN_CP.py
DQN_Dyna.py		DQN_Dyna.py
DQN_NOSET.py		DQN_NOSET.py
DQN_WM.py		DQN_WM.py
KeyRandDistShift.py		KeyRandDistShift.py
LICENSE		LICENSE
README.md		README.md
RandDistShift.py		RandDistShift.py
RandDistShift3.py		RandDistShift3.py
components.py		components.py
components_CP.py		components_CP.py
components_NOSET.py		components_NOSET.py
requirements.txt		requirements.txt
run_distshift.py		run_distshift.py
run_distshift_randomized.py		run_distshift_randomized.py
run_distshift_randomized_mp.py		run_distshift_randomized_mp.py
runtime.py		runtime.py
tree_search.py		tree_search.py
utils.py		utils.py
utils_mp.py		utils_mp.py
utils_mp_dyna.py		utils_mp_dyna.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

(BLOGPOST)

Install Dependencies

Reproducing Results

Changing Settings

About

Releases

Packages

Languages

License

mila-iqia/Conscious-Planning

Folders and files

Latest commit

History

Repository files navigation

A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

(BLOGPOST)

Install Dependencies

Reproducing Results

Changing Settings

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages