- 4.1. Understanding the Monte Carlo Method
- 4.2. Prediction and Control Tasks
- 4.2.1. Prediction Task
- 4.2.2. Control Task
- 4.3. Monte Carlo Prediction
- 4.3.1. MC Prediction Algorithm
- 4.3.2. Types of MC prediction
- 4.3.3. First-visit Monte Carlo
- 4.3.4. Every visit Monte Carlo
- 4.4. Understanding the BlackJack Game
- 4.4.1. Blackjack Environment in the Gym
- 4.5. Every-visit MC Prediction with Blackjack Game
- 4.6. First-visit MC Prediction with Blackjack Game
- 4.7. Incremental Mean Updates
- 4.8. MC Prediction (Q Function)
- 4.9. Monte Carlo Control
- 4.10. On-Policy Monte Carlo Control
- 4.11. Monte Carlo Exploring Starts
- 4.12. Monte Carlo with Epsilon-Greedy Policy
- 4.7.5. Algorithm MC Control with Epsilon-Greedy Policy
- 4.13. Implementing On-Policy MC Control
- 4.14. Off-Policy Monte Carlo Control
- 4.15. Is MC Method Applicable to all Tasks?
04. Monte Carlo Methods
Folders and files
Name | Name | Last commit date | ||
---|---|---|---|---|
parent directory.. | ||||