#

srpo

Here is 1 public repository matching this topic...

thu-ml / SRPO

Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).

reinforcement-learning offline rl generative diffusion score-based-models d4rl srpo behavior-regularization

Updated Feb 10, 2024
Python

Improve this page

Add a description, image, and links to the srpo topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the srpo topic, visit your repo's landing page and select "manage topics."