Repository in under construction
The algorithm learns to beat traditional data using pixel data from game environment(the game environment is in the repository itself.) (The game is traditional snake game with food appearing on random states.)
As one can see the model outperforms the environment getting almost full possible score.The model qalso learns to avoid different traps by the game.
- David Silver Lecture
- Sutton and Barto Book : Reinforcement Learning : An introduction
- Actor-Critic Methods: A3C and A2C
- Asynchronous Methods for Deep Reinforcement Learning
- Continuous control with deep reinforcement learning
- Playing Atari with Deep Reinforcement Learning
Repository is still building I am trying Actor Critic Models.The result will be uploaded soon.