Deep Reinforcement Learning Nanodegree - Project 1: Navigation

Introduction

For this project, I’ve trainned an agent to navigate and collect bananas in a square world using Unit environment. You can find bellow the two conditions: without any trainning and trainned agent.

Each time the agent collect a yellow banana, it’s given a reward of +1. For each blue banana, it received a -1 reward. The goal of the agent is to collect as many yellow bananas as possible and avoid any blue bananas, as it must increase the score given by the amount of rewards received.

The state space has 37 dimensions and contains the agent's velocity, along with ray-based perception of objects around agent's forward direction. Given this information, the agent has to learn how to best select actions. 4 discrete actions are available, corresponding to:

0 - move forward.
1 - move backward.
2 - turn left.
3 - turn right.

To complete the task the agent must get an average score of 13 over 100 consecutive episodes.

Starting

Clone this repo.
Setup the python enviroment following next link:
Copy the content of the p1_navigation/ folder from this repo to the p1_navigation/ folder of the udacity/deep-reinforcement-learning repo and replaces or remove existing files.
Unzip the Banana_Linux.zip file that is located under the p1_navigation/ folder under the same directory. If you are not using Linux, follow the instructions on the botton of this file.

Instructions

Open a jupyter notebook and open the Navigation.ipynb to train or test the agent.

For training from zero run all the cells inside the navigation notebook
For testing skip the training section and follow the instructions to load the weights.

Download the environment (not Linux users)

You need to select the environment that matches your operating system:

Linux:
Mac OSX:
Windows (32-bit):
Windows (64-bit):

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
Navigation.ipynb		Navigation.ipynb
README.md		README.md
REPORT.md		REPORT.md
average.png		average.png
checkpoint_dqn.pth		checkpoint_dqn.pth
score.png		score.png
sem_treinamento.gif		sem_treinamento.gif
solved.png		solved.png
trainned.gif		trainned.gif

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep Reinforcement Learning Nanodegree - Project 1: Navigation

Introduction

Starting

Instructions

Download the environment (not Linux users)

About

Releases

Packages

Languages

smalvar/Navigation-DeepReinforcementLearning

Folders and files

Latest commit

History

Repository files navigation

Deep Reinforcement Learning Nanodegree - Project 1: Navigation

Introduction

Starting

Instructions

Download the environment (not Linux users)

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages