Skip to content

Latest commit

 

History

History
10 lines (5 loc) · 438 Bytes

README.md

File metadata and controls

10 lines (5 loc) · 438 Bytes

Generative replay addition to TD3

PyTorch implementation of Twin Delayed Deep Deterministic Policy Gradients (TD3) with a generative replay component.

The code is heavily modified to work for my research needs

Method is tested on MuJoCo continuous control tasks in OpenAI gym. Networks are trained using PyTorch 1.7 and Python 3.8.