DDPG-TensorFlow

This is an tensorflow implementation of the paper "Continuous control with deep reinforcement learning". The code is adapted from here with some improvement. The main difference between the MorvanZhou's version and this code is

Using tf.train.ExponentialMovingAverage to soft update the target network instead of tf.assign;
Changing the activation function tf.nn.relu to tf.nn.elu;
Using tf.train.AdagradOptimizer to train the neural network;
Batch Updating the parameters using accumulating averaged gradients instead of the mean of mini batch.

The speed is 4.0X faster than the MorvanZhou's version when running on the same desktop computer with GTX 960 GPU, and achieving more stable learning curve.

Typing python Train.py in the terminal to run, make sure you've already installed tensorflow and open ai gym.

Any comment or suggestion is welcome, you can open an issue or contact me via "fengredrum@gmail.com".

More deep RL algorithm is coming soon!

Reference

Lillicrap, T. P., Hunt, J. J., Pritzel, A., Heess, N., Erez, T., Tassa, Y., ... & Wierstra, D. (2015). Continuous control with deep reinforcement learning. arXiv preprint arXiv:1509.02971.
https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow
https://github.com/songrotek/DDPG
https://github.com/RuiShu/micro-projects/tree/master/tf-ema

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
results		results
.gitattributes		.gitattributes
DDPG_update.py		DDPG_update.py
LICENSE		LICENSE
README.md		README.md
Train.py		Train.py
actor_network.py		actor_network.py
base_line.py		base_line.py
critic_network.py		critic_network.py
ddpg.py		ddpg.py
replayBuffer.py		replayBuffer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DDPG-TensorFlow

Reference

About

Releases

Packages

Languages

License

fengredrum/DDPG-TensorFlow

Folders and files

Latest commit

History

Repository files navigation

DDPG-TensorFlow

Reference

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages