Mitigating the Stability-Plasticity Dilemma in Adaptive Train Scheduling with Curriculum-Driven Continual DQN Expansion

This repository contains the code and resources for our paper, which addresses the stability-plasticity dilemma in adaptive train scheduling. In this work, we introduce different curricula for train scheduling and an algorithm called "Continual Deep Q-Network Expansion" (CDE) to improve agent adaptability in non-stationary environments. Our method dynamically adjusts Q-function subspaces and utilizes Elastic Weight Consolidation (EWC) and Rational Padé Activation Function to adress Catastrophic Forgetting and decreasing Network Plasticity, achieving superior learning efficiency and generalization compared to traditional RL baselines. The curriculum design emphasizes skill building through adjacent tasks, helping agents retain learned knowledge while effectively acquiring new behaviors.

Code for the Continual-DQN-Expansion-Algorithm
Code Implementations of 4 different Base Learning Curricula
Code to evaluate the Results
Link to the Paper
Simulator used is Flatland-RL by AICrowd

Continual-DQN-Expansion (CDE)

Usage

Requirements

-After cloning you can create an environment using:
conda env create -f environment.yml

python -m pip install flatland-rl

python -m pip install torch ~

Training

Run CDE on the Custom Curriculum:

python multi_agent_training.py --curriculum="customPMD" --policy="CDE" --hidden_size=1024 --layer_count=2 --ewc_lambda=0.5

Evaluation

-Create Plot of Training and Evaluation-Completions/Score

python eval_training.py --file="score_***.csv" --type="score"

python eval_training.py --file="completions_****.csv" --type="completions"

-Create Animation of PAU-Activation-Function

python eval_weights.py --file="weights_***.csv" --network="0" --layer=0

Name		Name	Last commit message	Last commit date
Latest commit History 97 Commits
Evaluation		Evaluation
MARL/flatland-starter-kit-master		MARL/flatland-starter-kit-master
environment.yml		environment.yml
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Mitigating the Stability-Plasticity Dilemma in Adaptive Train Scheduling with Curriculum-Driven Continual DQN Expansion

Continual-DQN-Expansion (CDE)

Usage

Requirements

Training

Evaluation

Results

Baselines

DQN Evaluation on the Custom Curriculum without Rehearsal

About

Releases

Packages

Languages

EtienneKuenzel/Continual-DQN-Expansion

Folders and files

Latest commit

History

Repository files navigation

Mitigating the Stability-Plasticity Dilemma in Adaptive Train Scheduling with Curriculum-Driven Continual DQN Expansion

Continual-DQN-Expansion (CDE)

Usage

Requirements

Training

Evaluation

Results

Baselines

DQN Evaluation on the Custom Curriculum without Rehearsal

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages