Code for paper: Model-based Reinforcement Learning for Continuous Control with Posterior Sampling (https://arxiv.org/abs/2012.09613)

Please see requirements.txt for package dependencies.

Directly run files with configuration for each environment:

Stochastic Cartpole:

python run_cartpole.py --with-reward True

(with oracle rewards)

python run_cartpole.py --with-reward False

(without oracle rewards)

Stochastic Pendulum:

python run_pendulum.py --with-reward True

(with oracle rewards)

python run_pendulum.py --with-reward False

(without oracle rewards)

Reacher:

python run_reacher --with-reward True

(with oracle rewards)

python run_reacher.py --with-reward False

(without oracle rewards)

Pusher:

python run_pusher.py --with-reward True

(with oracle rewards)

python run_pusher.py --with-reward False

(without oracle rewards)

Cumulative rewards are saved as envname_log.txt files.

If you find the code useful, please cite:

@InProceedings{pmlr-v139-fan21b,
  title = 	 {Model-based Reinforcement Learning for Continuous Control with Posterior Sampling},
  author =       {Fan, Ying and Ming, Yifei},
  booktitle = 	 {Proceedings of the 38th International Conference on Machine Learning},
  pages = 	 {3078--3087},
  year = 	 {2021},
  editor = 	 {Meila, Marina and Zhang, Tong},
  volume = 	 {139},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {18--24 Jul},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v139/fan21b/fan21b.pdf},
  url = 	 {https://proceedings.mlr.press/v139/fan21b.html}

Name		Name	Last commit message	Last commit date
Latest commit History 57 Commits
assets		assets
tf_models		tf_models
CEM_with.py		CEM_with.py
CEM_without.py		CEM_without.py
NB_dx_tf.py		NB_dx_tf.py
README.md		README.md
cartpole_continuous.py		cartpole_continuous.py
pendulum_gym.py		pendulum_gym.py
pusher.py		pusher.py
reacher.py		reacher.py
requirements.txt		requirements.txt
run_cartpole.py		run_cartpole.py
run_pendulum.py		run_pendulum.py
run_pusher.py		run_pusher.py
run_reacher.py		run_reacher.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Code for paper: Model-based Reinforcement Learning for Continuous Control with Posterior Sampling (https://arxiv.org/abs/2012.09613)

Directly run files with configuration for each environment:

About

Releases

Packages

Languages

yingfan-bot/mbpsrl

Folders and files

Latest commit

History

Repository files navigation

Code for paper: Model-based Reinforcement Learning for Continuous Control with Posterior Sampling (https://arxiv.org/abs/2012.09613)

Directly run files with configuration for each environment:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages