Skip to content

sharon12312/reinforcement-learning-final-project

Repository files navigation

IDC – Reinforcement Learning - Final Project - 2022


Getting Started

The main goal of this final project is to summarize the main topics that we have discussed in the course using some practice and theory, and especially the second part of course (Deep RL).
In this project you will solve several variations of the Highway env.

Highway is a collection of environments for autonomous driving and tactical decision-making tasks. Our goal is to help the user car overtake the bot cars on its own in a roadway environment.
We train the user car with the help of deep reinforcement learning, the reward function will penalize the user car every time it slows down, every time it crashes into bot car and if there are any bot cars in front of it. In the following environments we will use the raw pixels as our state space, therefore, it will allow to train CNN Neural Networks.


Instructions

Upload Weights

  • Open the submit.txt file and enter the link for the weights folder.
    • We used google-drive to share the weights since we cannot upload them into the submission box.
  • Download the given weights folder from the shared link and extract it into your google drive.
  • Once the folder is set, please perform the "Import Weights" section code within the google-colab notebook. This function loads the weights folder into your colab hosted runtime.
  • Grant permissions by clicking on the "Connect to Google Drive" button to allow the notebook to access the weights folder.
  • The folder should contain:
├── ex1_w
│   ├── dqn_weights_easy.h5
│   ├── ddqn_weights_easy.h5
│   ├── a3c_icm_weights.h5
│   └── icm_a3c_icm_weights.h5
├── ex2_w
│   ├── dqn_weights_medium.h5
│   └── ddqn_weights_medium.h5
└── ex3_w
    ├── dqn_weights_medium.h5
    └── ddqn_weights_medium.h5

Run the Agents Evaluations

To evaluate each agent and construct the environment's video, you would require to perform the instructions below:

  • Run the "Project Algorithms" section (running 25 cells).
    • As mentioned above, the "Import Weights" section will require you to grant access to your google drive in order to load the models' weights to your local host runtime.
  • Run the "Agents Evaluation" section to evaluate each agent according to the exercise guidelines.
    • Environments Utils, for loading the relevant configuration and utils objects within the notebook.
    • Highway-Env Easy.
    • Highway-Env Medium.
    • Super Highway Agent.
  • In each section, run the code to construct the environment's video for each model (DQN, DDQN and A3C & ICM).

Authors

  • Sharon Mordechai.
  • Amit Huli.