Skip to content

Latest commit

 

History

History
49 lines (42 loc) · 1.69 KB

README.md

File metadata and controls

49 lines (42 loc) · 1.69 KB

online-dt

This repository contains the Pytorch implementation of Online Decision Transformer by Qinqing Zheng, Amy Zhang, and Aditya Grover.

If you use this code for your research, please cite us as:

@inproceedings{zheng2022online,
  title={Online decision transformer},
  author={Zheng, Qinqing and Zhang, Amy and Grover, Aditya},
  booktitle={international conference on machine learning},
  pages={27042--27059},
  year={2022},
  organization={PMLR}
}

Requirements

conda env create -f conda_env.yml
source activate odt

Tips

If you encounter the libstdc++.so.6: version 'GLIBCXX_3.4.xx' not found error, the following command might help:

export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:<path-to-your-conda-env>/lib

I have also found that tensorboard wants protobuf version to be 3.20.x, and this helped

# you might need to uninstall dm-control
pip3 install --upgrade protobuf==3.20.0

Example

To train an ODT agent for hopper with the medium-v2 dataset:

python main.py

This will produce the exp folder, where all the outputs are going to be logged including tensorboard blobs. One can attach a tensorboard to monitor training by running:

tensorboard --logdir exp

License

The majority of online-dt is licensed under CC-BY-NC, however portions of the project are available under separate license terms:

  • D4RL dataset - Creative Commons Attribution 4.0 License (CC-BY)
  • D4RL code, transformers, Lamb - Apache 2.0 License
  • stable-baselines3, Gym, decision-transformer - MIT License