Skip to content

Latest commit

 

History

History
35 lines (19 loc) · 2.22 KB

README.md

File metadata and controls

35 lines (19 loc) · 2.22 KB

OpenAI Scholars: Reinforcement Learning Self-Study

Week 1: Markov Decision Processes

Resources

  1. Reinforcement Learning: An Introduction, Sutton and Barto, 2nd Edition Jan 1 2018 Draft Chapter 3: Markov Decision Processes and Chapter 4: Dynamic Programming

  2. Deep RL Bootcamp Core Lecture 1 Intro to MDPs and Exact Solution Methods -- Pieter Abbeel Video | Slides

  3. Deep RL Bootcamp Core Lecture 2 Sample-based Approximations and Fitted Learning -- Rocky Duan Video | Slides

  4. Deep RL Bootcamp Lab 1: Markov Decision Processes You will implement value iteration, policy iteration, and tabular Q-learning and apply these algorithms to simple environments including tabular maze navigation (FrozenLake) and controlling a simple crawler robot.

  5. CS294 Reinforcement learning introduction -- Levine Video | Slides

  6. CS294 Value functions introduction -- Levine Video | Slides

  7. Introduction to Reinforcement Learning by Joshua Achiam, OpenAI Slides

Notes

RL Algorithms Diagram alt text

Interaction in Markov decision process

alt text

Value Iteration in an MDP

alt text

OpenAI’s Crawler robot attempting to walk with random actions. video

The same Crawler robot after it has been trained for 30,000 steps with a Q-learning algorithm. video