A Toolkit for Distributional Control of Generative Models
machine-learning ai alignment language-models monte-carlo-sampling generative-models fine-tuning human-preferences distributional-policy-gradients
-
Updated
Sep 4, 2023 - Python