Skip to content

Latest commit

 

History

History
32 lines (20 loc) · 1.3 KB

README.md

File metadata and controls

32 lines (20 loc) · 1.3 KB

Implementaion RL Algorithm with Pytorch

Policy Based

1. Reinforce : Done

2-1 PPO with Continuous action space :Done

2-2 PPO with Atari Environment : Done

3. Deep Deterministic Policy Gradient : Pendulum Env Done

Value Based

1. Deep Q-Learning : Done

2. Dobule DQN : Done

3. Dueling DQN : Done

4. C51 : need fix

Sampling Method

Sparse Reward Env