Apply REINFORCE algorithm
-
Updated
Feb 28, 2023 - Jupyter Notebook
Apply REINFORCE algorithm
Tensorflow implementation of Proximal Policy Optimization (Reinforcement Learning) and its common optimizations. Features Tensorboard integration and lots of sample runs on custom, classical and robotics oriented environments.
Add a description, image, and links to the policy-gradient-algorithm topic page so that developers can more easily learn about it.
To associate your repository with the policy-gradient-algorithm topic, visit your repo's landing page and select "manage topics."