🎯
Focusing
PhD in Reinforcement Learning, LLM Alignment, RLHF
-
University of Cambridge
- https://holarissun.github.io/
- @HolarisSun
Pinned Loading
-
Prompt-OIRL
Prompt-OIRL Publiccode for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning
-
-
RewardShifting
RewardShifting PublicCode for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL
-
YangRui2015/AWGCSL
YangRui2015/AWGCSL PublicCode for ICLR 2022 paper Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL.
-
PCHID_code
PCHID_code PublicCode for [NeurIPS'2019 Spotlight] Policy Continuation with Hindsight Inverse Dynamics
Jupyter Notebook 15
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.