Skip to content
@LAMDA-RL

LAMDA-RL

We are a fork of reinforcement learning researchers from LAMDA Group @ Nanjing University.

LAMDA-RL Lab

LAMDA-RL Lab is at the forefront of advancing the field of reinforcement learning and its application to creating general decision-making intelligence, by pushing the boundaries of what's possible with RL techniques.

We focus on developing novel algorithms and architectures that enable RL systems to learn and make decisions in increasingly general and adaptable ways. Some key areas we are exploring include:

  • Imitation learning;
  • Offline reinforcement learning;
  • Model-based RL and world model learning;
  • Multi-agent and collaborative RL;
  • Planning and learning with large models.

Through both fundamental and application research, our aim is to create RL-based systems that exhibit truly intelligent and general decision-making capabilities. For more information about our lab and research, please refer to our website https://lamda-rl.nju.edu.cn/.

Pinned Loading

  1. OfflineRL-Lib OfflineRL-Lib Public

    Benchmarked implementations of Offline RL Algorithms.

    Python 62 7

  2. ODIS ODIS Public

    The implementation of ICLR-2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".

    Python 31 5

  3. PRDC PRDC Public

    Forked from kimoyami/PRDC

    Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D4RL gym and AntMaze tasks.

    Python 14 4

  4. ACT ACT Public

    Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)

    Python 9 3

  5. Pretrained_BWArea_2.7B_30G Pretrained_BWArea_2.7B_30G Public

    Pre-trained Models of BWArea Model

    Python 8

  6. CPR CPR Public

    Forked from LyndonKong/CPR

    Python 1

Repositories

Showing 10 of 28 repositories
  • Pretrained_BWArea_2.7B_30G Public

    Pre-trained Models of BWArea Model

    LAMDA-RL/Pretrained_BWArea_2.7B_30G’s past year of commit activity
    Python 8 0 0 0 Updated Sep 10, 2024
  • WiseRL Public Forked from typoverflow/WiseRL

    PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms

    LAMDA-RL/WiseRL’s past year of commit activity
    Python 1 MIT 1 0 0 Updated Sep 5, 2024
  • .github Public
    LAMDA-RL/.github’s past year of commit activity
    0 0 0 0 Updated Sep 4, 2024
  • madac Public Forked from lamda-bbo/madac

    Official implementation of NeurIPS22 paper “Multi-agent Dynamic Algorithm Configuration”

    LAMDA-RL/madac’s past year of commit activity
    Python 0 Apache-2.0 7 0 0 Updated Sep 4, 2024
  • CPR Public Forked from LyndonKong/CPR
    LAMDA-RL/CPR’s past year of commit activity
    Python 1 1 0 0 Updated Sep 4, 2024
  • LAMDA-RL/unstable_baselines’s past year of commit activity
    Python 0 12 0 0 Updated Sep 4, 2024
  • UtilsRL Public
    LAMDA-RL/UtilsRL’s past year of commit activity
    Python 0 MIT 0 0 0 Updated Sep 4, 2024
  • MATTAR Public Forked from chenf-ai/MATTAR

    Official code repository for "Multi-Agent Policy Transfer via Task Relationship Modeling".

    LAMDA-RL/MATTAR’s past year of commit activity
    Python 0 1 0 0 Updated Aug 16, 2024
  • ODIS Public

    The implementation of ICLR-2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".

    LAMDA-RL/ODIS’s past year of commit activity
    Python 31 Apache-2.0 5 1 0 Updated Jul 21, 2024
  • policy-conditioned-model Public Forked from xionghuichen/policy-conditioned-model

    official code of "Effective Offline Environment Reconstruction when the Dataset is Collected from Diversified Behavior Policies"

    LAMDA-RL/policy-conditioned-model’s past year of commit activity
    Python 1 Apache-2.0 1 0 0 Updated Jul 15, 2024

Top languages

Loading…

Most used topics

Loading…