Metric Residual Networks for Sample Efficient Goal-Conditioned Reinforcement Learning

This repo contains the official implementation for Metric Residual Networks, and other neural architectures for the goal-conditioned reinforcement learning (GCRL) critic network.

12 GCRL environments

Implemented Critic Networks

Critic Architecture
Monolithic Network
Deep/Wide Norms (DN/WN)
Bilinear Value Network (BVN)
Poisson Quasimetric Embedding (PQE)
Metric Residual Network (MRN)

update 2022/12/6: Thank @SsnL for pointing out the bug that the metric part should be l2-norm instead of square loss. (See this paper)

1. Dependencies

Create conda environment.

conda create -n metric-residual-network python=3.7.4
conda activate metric-residual-network

Install PyTorch

pip install torch==1.10.1+cu111 torchvision==0.11.2+cu111 torchaudio==0.10.1 -f https://download.pytorch.org/whl/torch_stable.html

Download mujoco200. Then install pip requirements:

pip install -r requirements.txt

2. Code structure

The code structure is listed in below. Note that we provide

Method	Paper
Model-based Hindsight Experience Replay	MHER
Goal-Conditioned Supervised Learning	GCSL
Weighted Goal-Conditioned Supervised Learning	WGCSL

implementation in PyTorch as well for the convenience of future research, though they are not used in our paper.

metric-residual-network
 └─run_all.sh (the script to reproduce all results using different critics)
 └─run.sh     (the script to run with a specific critic architecture)
 └─main.py    (the main file to run all code)
 └─plot.py    (plotting utils to make figures in the paper)
 └─src
    └─model.py (include different critic architectures, and the actor architecture)
    └─agent
       └─base.py  (base class for goal-conditioned agent)
       └─her.py   (the Hindsight Experience Replay agent)
       └─ddpg.py  (DDPG agent)
       └─mher.py  (M-HER agent)
       └─gcsl.py  (GCSL agent)
       └─wgcsl.py (WGCSL agent)

2. To reproduce results in the paper

./run_all.sh

3. Logs and checkpoints of trained models

For the convenience of future research, you can download all pretrained checkpoints/logs from here and unzip and put it under this repo. Then you can use plot.py to reproduce the 2 main figures about the results on the 12 Fetch environments.

4. Citations

If you find our work interesting or the repo useful, please consider citing this paper:

@article{liu2022metric,
  title={Metric Residual Networks for Sample Efficient Goal-conditioned Reinforcement Learning},
  author={Liu, Bo and Feng, Yihao and Liu, Qiang and Stone, Peter},
  journal={arXiv preprint arXiv:2208.08133},
  year={2022}
}

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
misc		misc
src		src
.gitignore		.gitignore
README.md		README.md
main.py		main.py
plot.py		plot.py
requirements.txt		requirements.txt
run.sh		run.sh
run_all.sh		run_all.sh
run_ddpg.sh		run_ddpg.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Metric Residual Networks for Sample Efficient Goal-Conditioned Reinforcement Learning

12 GCRL environments

Implemented Critic Networks

1. Dependencies

2. Code structure

2. To reproduce results in the paper

3. Logs and checkpoints of trained models

4. Citations

About

Releases

Packages

Languages

Cranial-XIX/metric-residual-network

Folders and files

Latest commit

History

Repository files navigation

Metric Residual Networks for Sample Efficient Goal-Conditioned Reinforcement Learning

12 GCRL environments

Implemented Critic Networks

1. Dependencies

2. Code structure

2. To reproduce results in the paper

3. Logs and checkpoints of trained models

4. Citations

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages