Implementation of "Unsupervised Task Clustering for Multi-Task Reinforcement Learning", paper available here

We use sacred to configure and monitor our experiments.

The implementation is split into three main files:

train_em.py for EM, SP and PPT
train_multi_head.py for the Multi-Head experiments
train_atari_em.py for all Atari experiments

The implementation of the Atari experiments is closely based on the Dopamine framework.

Usage

To use our code, please install dependencies

pip install -r requirements.txt

and run our experiments with

python train_em.py

To chose a different set of tasks, use the sacred CLI logic, for example to replicate our experiments on the pendulum task set use:

python train_em.py with environment=pendulum

Logging is done via the sacred logging interface, which stores the results in results/sacred/$EXPERIMENT_ID/. They can then be processed for example with incense.

Potential Issues

If the source code is not running as expected, please try to install some system packages that gym might require for the atari games.

apt-get install -y libglu1-mesa-dev libgl1-mesa-dev libosmesa6-dev xvfb ffmpeg curl patchelf libglfw3 libglfw3-dev cmake zlib1g zlib1g-dev swig

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
agents		agents
common		common
custom_env		custom_env
dopamine		dopamine
Appendix_EM_Task_Clustering_for_Multitask_RL.pdf		Appendix_EM_Task_Clustering_for_Multitask_RL.pdf
README.md		README.md
requirements.txt		requirements.txt
train_atari_em.py		train_atari_em.py
train_em.py		train_em.py
train_multi_head.py		train_multi_head.py
training_snippets.py		training_snippets.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Implementation of "Unsupervised Task Clustering for Multi-Task Reinforcement Learning", paper available here

Usage

Potential Issues

About

Releases

Packages

Contributors 2

Languages

JohannesAck/EMTaskClustering

Folders and files

Latest commit

History

Repository files navigation

Implementation of "Unsupervised Task Clustering for Multi-Task Reinforcement Learning", paper available here

Usage

Potential Issues

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages