Reinforcement Learning Catch Example

This is the repository for the reinforcement learning catch example. In this project, we used RL to train an agent controlling a basket to catch a piece of falling fruit. We define an environment entirely using matplotlib.

Click here to read more about the theory behind the RL agent and the learning process.

Getting Started

Pre-requisites

Python 3.9.6
pyenv
poetry

For Unix users, we recommend the use of pyenv to manage the Python version as specified in .python-version. See below for instructions on setting up poetry and pyenv on your system.

For MacOS:

Install poetry pip install poetry
Install pyenv

For Windows:

Install poetry pip install poetry
Pyenv does not officially support Windows, therefore you should instead ensure you have the correct version Python 3.9.6

Installation

For first-time users in the root directory run:
1. poetry install to install all dependencies
2. pyenv install to install and use the correct version of python (for MacOS users)

Running the code

To run the training script locally run poetry run python src/train.py
Note: if you're just wanting to see the code in action, changing the number of epochs parameter in train.py will reduce the training time, enabling you to run the code quickly.
To run the inference script locally:
1. Make sure you have a model file stored as model/model.h5
2. Run poetry run python src/run.py

Docker

To run the docker image:

Make sure you have a Docker daemon running (e.g. using Docker Desktop)
Run docker build -t catch . to build the dockerfile into an image with the tag catch
Run docker run catch to run the training image in a container. (add the --rm flag to delete the container after it has run). To obtain the gif resulting from the model running, use docker run -v $(pwd):/rl-catch-example/gif catch. In addition -e can be used to specify run-time environment variables.

Adjusting Training & Run Parameters

To explore and adjust the model training parameters, you can set the environment variables:

TRAIN_EPOCHS
TRAIN_EPSILON
TRAIN_MAX_MEMORY
TRAIN_HIDDEN_SIZE
TRAIN_HIDDEN_LAYERS
TRAIN_BATCH_SIZE
TRAIN_GRID_SIZE

In addition, if you want to warm start the model, set TRAIN_WARM_START_PATH with a previous model's weights file. For example ./model/model.h5

Model run environment variables:

RUN_GAME_ITERATIONS

Direnv

Direnv is a great tool to define environment variables at a folder level.
To set up direnv, install the tool and create a .envrc file with environment variables to define.
.envrc.sample is a sample direnv config file which can be copied.

To initiate direnv on the repo, run the command:

direnv allow

Contributing

If you would like to develop on this repo ensure that the pre-commit hooks run on your local machine. To enable this run:

pip install pre-commit
pre-commit install

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.github/workflows		.github/workflows
src		src
.envrc.sample		.envrc.sample
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
CODEOWNERS		CODEOWNERS
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement Learning Catch Example

Getting Started

Pre-requisites

For MacOS:

For Windows:

Installation

Running the code

Docker

Adjusting Training & Run Parameters

Direnv

Contributing

About

Releases

Packages

Contributors 3

Languages

License

teamdatatonic/rl-agent-playing-catch

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning Catch Example

Getting Started

Pre-requisites

For MacOS:

For Windows:

Installation

Running the code

Docker

Adjusting Training & Run Parameters

Direnv

Contributing

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages