Skip to content

Latest commit

 

History

History
 
 

04. Monte Carlo Methods

4. Monte Carlo Methods

  • 4.1. Understanding the Monte Carlo Method
  • 4.2. Prediction and Control Tasks
    • 4.2.1. Prediction Task
    • 4.2.2. Control Task
  • 4.3. Monte Carlo Prediction
    • 4.3.1. MC Prediction Algorithm
    • 4.3.2. Types of MC prediction
    • 4.3.3. First-visit Monte Carlo
    • 4.3.4. Every visit Monte Carlo
  • 4.4. Understanding the BlackJack Game
    • 4.4.1. Blackjack Environment in the Gym
  • 4.5. Every-visit MC Prediction with Blackjack Game
  • 4.6. First-visit MC Prediction with Blackjack Game
  • 4.7. Incremental Mean Updates
  • 4.8. MC Prediction (Q Function)
  • 4.9. Monte Carlo Control
  • 4.10. On-Policy Monte Carlo Control
  • 4.11. Monte Carlo Exploring Starts
  • 4.12. Monte Carlo with Epsilon-Greedy Policy
    • 4.7.5. Algorithm MC Control with Epsilon-Greedy Policy
  • 4.13. Implementing On-Policy MC Control
  • 4.14. Off-Policy Monte Carlo Control
  • 4.15. Is MC Method Applicable to all Tasks?