Gazebo Gym Environment for learning force control with peg insertion tasks (Universal Robots)

This repository is based on our public repositories ur3 and ur_openai_gym.

Installation

Tested on ROS Melodic - Ubuntu 18.04

Recommended installation using Docker: follow wiki instructions

Usage: Peg-In-Hole tasks

Training

On one terminal, launch a gazebo environment for an specific peg:

roslaunch ur3_gazebo ur_peg.launch peg_shape:=cuboid

valid shapes [cylinder, hexagon, cuboid, triangular, trapezoid, star]

On a different terminal, execute the training script:

rosrun ur3e_rl tf2rl_sac.py -e0

The training should start:

The robot goes to a given initial position,
a taskboard spawned randomly within a defined workspace,
the robot moves towards the goal pose, defined with respect to the current pose of the taskboard
the training lasts 100,000 time steps.

There are 3 sample environments defined in the training script tf2rl_sac.py

For a training environment with a single peg, which does not change during the entire training session.
For a training environment with multiple pegs, the entire robot is reset when the peg is changed. **Note** in this case the simulation tends to be unstable without a powerful CPU/GPU. The `real time factor` start degrading with time until it barely moves.
A sample configuration for training/testing with a real robot.

The hyper-parameters of each environment is defined in a YAML config file, which can be found on the packageur3e_rl/config

After a training session is started, training data will be store in a folder results/[datetime]_**dir_suffix** under the current directory. More information can be found in the RL repository tf2rl

Testing

A learned policy can be executed using the helper script evaluate_policy.py:

rosrun ur3e_rl evaluate_policy.py -n 50 results/[datetime]_**dir_suffix**

Useful parameters of this script

--training: actions are taken similarly as during training, with a randomness to it, i.e. a sample from a Gaussian distribution. Without this, the actions are taken as the mean of the Gaussian distribution, a lot more stable behavior.
-n: number of test to execute
-s: store/record observations from tests.

References

If you find this code useful, consider citing our work:

@article{
  author = {Beltran-Hernandez, Cristian C. and Petit, Damien and Ramirez-Alpizar, Ixchel G. and Harada, Kensuke},
  title = {Accelerating Robot Learning of Contact-Rich Manipulations: A Curriculum Learning Study},
  doi = {10.48550/ARXIV.2204.12844},
  url = {https://arxiv.org/abs/2204.12844},
  publisher = {arXiv},
  year = {2022},
  copyright = {Creative Commons Attribution Non Commercial No Derivatives 4.0 International}
}

ArXiv: https://arxiv.org/abs/2204.12844

Supplemental Video: https://youtu.be/_FVQC5OcGjs

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
docker		docker
ur3_description		ur3_description
ur3_gazebo		ur3_gazebo
ur3e_openai		ur3e_openai
ur3e_rl		ur3e_rl
ur_control		ur_control
ur_pykdl		ur_pykdl
BUILD-DOCKER-IMAGE.sh		BUILD-DOCKER-IMAGE.sh
INCL-SUDO-ENV.sh		INCL-SUDO-ENV.sh
INCL-USER-ENV.sh		INCL-USER-ENV.sh
README.md		README.md
RUN-DOCKER.sh		RUN-DOCKER.sh
SETUP-DEVEL-MACHINE.sh		SETUP-DEVEL-MACHINE.sh
dependencies.rosinstall		dependencies.rosinstall
docker-workspace.code-workspace		docker-workspace.code-workspace

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Gazebo Gym Environment for learning force control with peg insertion tasks (Universal Robots)

Installation

Usage: Peg-In-Hole tasks

Training

Testing

References

About

Releases

Packages

Languages

cambel/robot-learning-cl-dr

Folders and files

Latest commit

History

Repository files navigation

Gazebo Gym Environment for learning force control with peg insertion tasks (Universal Robots)

Installation

Usage: Peg-In-Hole tasks

Training

Testing

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages