Skip to content

Latest commit

 

History

History
13 lines (9 loc) · 491 Bytes

README.md

File metadata and controls

13 lines (9 loc) · 491 Bytes

Policy Optimization with Demonstrations (POfD)

This repository is a reimplementation of Policy Optimization with Demonstrations (ICML 2018).

Dependencies

This code is highly based on OpenAI baselines gail.

Training

To run POfD on delayed Mujoco tasks:

python baselines/gail/run_mujoco.py --env_id Hopper-v1 --reward-freq 10 --num_epochs 1000