DRLearner

Open Source Deep Reinforcement Learning (DRL) library, based on Agent 57 (Badia et al, 2020). We recommend reading this documentation page to get the essence of DRLearner.

sudo apt-get update
sudo apt-get install libpython3.10 ffmpeg swig
conda create --name drlearner python=3.10
conda activate drlearner

export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:lib:/usr/lib:/usr/local/lib:~/anaconda3/envs/drlearner/lib
export PYTHONPATH=$PYTHONPATH:$(pwd)
conda env config vars set LD_LIBRARY_PATH=$LD_LIBRARY_PATH:lib:/usr/lib:/usr/local/lib:~/anaconda3/envs/drlearner/lib
conda env config vars set PYTHONPATH=$PYTHONPATH:$(pwd)

Install packages

pip install --no-cache-dir -r requirements.txt
pip install git+https://github.com/ivannz/gymDiscoMaze.git@stable

Venv

sudo apt-get update
sudo apt-get install libpython3.10 swig ffmpeg -y
python3.10 -m venv venv
source venv/bin/activate

export PYTHONPATH=$PYTHONPATH:$(pwd)

Install packages

pip install --no-cache-dir -r requirements.txt
pip install git+https://github.com/ivannz/gymDiscoMaze.git@stable

Binary files for Atari games

sudo apt-get install unrar
wget http://www.atarimania.com/roms/Roms.rar
unrar e  Roms.rar roms/
ale-import-roms roms/

Running DRLearner Agent

DRLearner comes with the following available environments:

Lunar Lander example

Training

python ./examples/run_lunar_lander.py --num_episodes 1000 --exp_path experiments/test_pong/ --exp_name my_first_experiment

Correct terminal output like this means that the training has been launched successfully:

To specify which directory to save changes in please specify exp_path. If model already exists in exp_path it will be loaded and training will resume. To name experiment in W&B please specify exp_name flag.

Observing Lunar Lander in action

To visualize any enviroment all you have to do is pass an instance of StorageVideoObserver to the enviroment. You pass and instance of DRLearnerConfig to the observer. In the config you can define

observers = [IntrinsicRewardObserver(), DistillationCoefObserver(),StorageVideoObserver(config)]
loop = EnvironmentLoop(env, agent, logger=logger_env, observers=observers)
loop.run(FLAGS.num_episodes)

Training with checkpoints (Montezuma)

Model will pick up from the moment it stopped in the previous training. Montezuma is the most difficult game so make sure you have enough computational power. Total number of actors is defined as number_of_actors_per_mixture*num_mixtures. If you will try to run too many actors your setup might break. If you have 16 cores of CPU we advice aroud 12 actors total.

python ./examples/distrun_atari.py  --exp_path artifacts/montezuma_base --exp_name montezuma_training

More examples of synchronous and distributed agents training within the environments can be found in examples/ .

Documentation

Ongoing Support

Join the DRLearner Developers List.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
artifacts/montezuma_base		artifacts/montezuma_base
docker-configurations		docker-configurations
docs		docs
drlearner		drlearner
examples		examples
external		external
scripts		scripts
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
compose.yaml		compose.yaml
my_process_entry.py		my_process_entry.py
requirements.txt		requirements.txt

Google Cloud Configuration	Local Configuration
(GCP)	(Local)
Tested on Ubuntu 20.4 with Python3.7	Tested on Ubuntu 22.04 with python3.10
Hardware: NVIDIA Tesla, 500 Gb drive	Hardware: 8-core i7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DRLearner

Table of contents

System Requirements

Installation

Creating environment

Conda

Venv

Binary files for Atari games

Running DRLearner Agent

Lunar Lander example

Training

Observing Lunar Lander in action

Training with checkpoints (Montezuma)

Documentation

Ongoing Support

About

Releases

Packages

Contributors 3

Languages

License

PatternsandPredictions/DRLearner_beta

Folders and files

Latest commit

History

Repository files navigation

DRLearner

Table of contents

System Requirements

Installation

Creating environment

Conda

Venv

Binary files for Atari games

Running DRLearner Agent

Lunar Lander example

Training

Observing Lunar Lander in action

Training with checkpoints (Montezuma)

Documentation

Ongoing Support

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages