Denavit–Hartenberg (DH) parameters for behavior cloning

Denavit-Hartenberg (DH) parameters are a set of four parameters which allow us to attach reference frames to links in a kinematic chain. We utilize this differentiable formulation for improving transfer in behavior cloning between robots with the same or different kinematic configurations.

train a new RL model:

python train_rl.py --cfg experiments/base_dm.cfg

evaluate and plot the latest model from a training directory:

python train_rl.py --eval --frames --load_model results/base_easy_01111_reacher_1_00/ --frames

This will result in plots for reward & target position, and a video of the episodes written as a derivitive of the most recent .pt file in the --load_model directory.

evaluate and plot a specific model:

python train_rl.py --eval --frames --load_model results/base_easy_01111_reacher_1_00/base_0000355000.pt

train a behavior cloning model which predicts relative JOINT_POSITIONS for a given state

with DH on the end effector or target link:

python train_bc.py --load_replay <path-to-rl-eval-replay-buffer>.pkl

on the joint angles:

python train_bc.py --load_replay <path-to-rl-eval-replay-buffer>.pkl --loss "angle"

now plot and evaluate the behavior cloning

NOTE: This is not really relevant with dm_control - but used with robosuite and JOINT_POSITION controller

python train_bc.py --load_model <path-to-saved-bc-model> --eval --frames

Adding Robot Configurations to dm_control

We utilize some non-standard dm_control robots for experiments. To work with these, copy the contents of the "robots" directory to your dm_control installation:

cd DH
cp robots/dm_control/*.* ../dm_control/dm_control/suite/ 
cd ../dm_control 
pip install .

Then you can experiment with these models as you would other robots.

For instance, to train an rl reacher agent with links which are double the size of the standard reacher:

cd DH
python train_rl.py --cfg experiments/reacher_double.cfg

Name		Name	Last commit message	Last commit date
Latest commit History 71 Commits
configs		configs
experiments		experiments
robots/dm_control		robots/dm_control
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
TD3.py		TD3.py
TD3_kinematic.py		TD3_kinematic.py
TD3_kinematic_critic.py		TD3_kinematic_critic.py
conda_env.yml		conda_env.yml
demo_DH.py		demo_DH.py
dh_parameters.py		dh_parameters.py
dh_utils.py		dh_utils.py
move_robot.py		move_robot.py
replay_buffer.py		replay_buffer.py
todo.txt		todo.txt
train_bc.py		train_bc.py
train_drl.py		train_drl.py
train_rl.py		train_rl.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Denavit–Hartenberg (DH) parameters for behavior cloning

train a new RL model:

evaluate and plot the latest model from a training directory:

evaluate and plot a specific model:

train a behavior cloning model which predicts relative JOINT_POSITIONS for a given state

with DH on the end effector or target link:

on the joint angles:

now plot and evaluate the behavior cloning

NOTE: This is not really relevant with dm_control - but used with robosuite and JOINT_POSITION controller

Adding Robot Configurations to dm_control

About

Releases

Packages

Languages

License

johannah/DH

Folders and files

Latest commit

History

Repository files navigation

Denavit–Hartenberg (DH) parameters for behavior cloning

train a new RL model:

evaluate and plot the latest model from a training directory:

evaluate and plot a specific model:

train a behavior cloning model which predicts relative JOINT_POSITIONS for a given state

with DH on the end effector or target link:

on the joint angles:

now plot and evaluate the behavior cloning

NOTE: This is not really relevant with dm_control - but used with robosuite and JOINT_POSITION controller

Adding Robot Configurations to dm_control

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages