Course work of Reinforcement-Learning-CS6700
reinforcement-learning monte-carlo q-learning policy-gradient sarsa multi-armed-bandits function-approximation hierarchical-reinforcement-learning gym-environment open-ai-gym dqn-tensorflow puddle-world four-rooms
-
Updated
Jul 22, 2020 - Jupyter Notebook