I utilized the A3C (Asynchronous Advantage Actor-Critic) algorithm to train a Deep Q-Learning (DQN) model, specifically tailored to solve the Kungfu gym environment.
-
Updated
Feb 7, 2024 - Python
I utilized the A3C (Asynchronous Advantage Actor-Critic) algorithm to train a Deep Q-Learning (DQN) model, specifically tailored to solve the Kungfu gym environment.
I developed and trained a deep convolutional Q-learning model to enable an agent to successfully solve the Pacman gym environment.
Add a description, image, and links to the deepconvolutionalqlearning topic page so that developers can more easily learn about it.
To associate your repository with the deepconvolutionalqlearning topic, visit your repo's landing page and select "manage topics."