OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
reinforcement-learning deep-learning algorithms openai-gym policy policy-gradient machine-learning-engineering trpo proximal-policy-optimization ppo self-play dota2-bot openai-five
-
Updated
Jun 20, 2018 - Python