A reinforcement learning framework for the game of Nim.
reinforcement-learning q-learning dqn sarsa dynamic-programming policy-iteration value-iteration expected-sarsa monte-carlo-methods double-q-learning temporal-difference-learning double-sarsa double-expected-sarsa n-step-bootstrapping n-step-sarsa n-step-expected-sarsa off-policy-n-step-sarsa off-policy-n-step-expected-sarsa n-step-tree-backup
-
Updated
Nov 21, 2020 - C++