antonio-f / Dynamic-Programming Star 12 Code Issues Pull requests Algorithms for Policy Evaluation, Estimation of Action Values, Policy Improvement, Policy Iteration, Truncated Policy Evaluation, Truncated Policy Iteration, Value Iteration . From Udacity's Deep Reinforcement Learning Nanodegree program. reinforcement-learning openai-gym gym dynamic-programming policy-evaluation policy-iteration value-iteration bellman-equation frozenlake policy-improvement state-value-function action-value-function Updated Apr 3, 2019 Jupyter Notebook
antonio-f / MonteCarlo-methods Star 2 Code Issues Pull requests Monte Carlo methods for Reinforcement Learning (from Udacity's "Deep Reinforcement Learning Nanodegree Program"). reinforcement-learning openai-gym gym monte-carlo-methods reinforcement-learning-excercises action-value-function blackjack-env state-va mc-prediciton mc-control Updated Apr 16, 2019 Jupyter Notebook