01. Epsilon Greedy
02. Optimistic Initial Values
03. UCB1
04. Bayesian Bandit Thompson Sampling
05. Iterative Policy Evaluation
06. Policy Iteration
07. Value Iteration
08. TD(0)
09. TD(λ)
10. SARSA
11. SARSA(λ)
12. Q-Learning
13. Deep Q-Learning