Skip to content

Latest commit

 

History

History
30 lines (24 loc) · 1.58 KB

README.md

File metadata and controls

30 lines (24 loc) · 1.58 KB

Reinforcement Learning

All the algorithms I implemented (using Python3 and NumPy) while reading Introduction to Reinforcement Learning by Sutton and Barto.
There's a separate ReadMe for each topic

High Level structure of the repo :

  1. Bandits
    • Epsilon-greedy
    • Optimistic initial value
    • Softmax exploration
  2. Dynamic Programming methods
    • Policy iteration
    • Value iteration
  3. Model free methods

Resources

CS234 and David silver often use different notations, it would be better to follow just one of them in the beginning (I prefer David Silver's lectures)

  1. Introduction to Reinforcement Learning by Sutton and Barto
  2. David Silver's lectures
  3. Stanford CS234

Check this out for more resources!