Generative Adversarial Imitation Learning

Implementation of Generative Adversarial Imitation Learning(GAIL) using tensorflow

Dependencies

python>=3.5
tensorflow>=1.4
gym>=0.9.3

Gym environment

Env==CartPole-v0
State==Continuous
Action==Discrete

Usage

Train experts

python3 run_ppo.py

Sample trajectory using expert

python3 sample_trajectory.py

Run GAIL

python3 run_gail.py

Run supervised learning

python3 run_behavior_clone.py

Test trained policy

python3 test_policy.py

Default policy is trained with gail
--alg=bc or ppo allows you to change test policy

If you want to test bc policy, specify the number of model.ckpt-number in the directory trained_models/bc
Example

python3 test_policy.py --alg=bc --model=1000

Tensorboard

tensorboard --logdir=log

Results


Fig.1 Training results	legend

LICENSE

MIT LICENSE

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Generative Adversarial Imitation Learning

Dependencies

Gym environment

Usage

Results

LICENSE

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 64 Commits
algo		algo
images		images
log		log
network_models		network_models
trained_models		trained_models
trajectory		trajectory
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
run_behavior_clone.py		run_behavior_clone.py
run_gail.py		run_gail.py
run_ppo.py		run_ppo.py
sample_trajectory.py		sample_trajectory.py
test_policy.py		test_policy.py

License

uidilr/gail_ppo_tf

Folders and files

Latest commit

History

Repository files navigation

Generative Adversarial Imitation Learning

Dependencies

Gym environment

Usage

Results

LICENSE

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages