Skip to content

Official Implementation of "Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice"

Notifications You must be signed in to change notification settings

matsuolab/Deep-Variance-Weighting-Maze

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Official Implementation of Deep Variance Weighting (DVW) [Experiments in Section 7.2.1]

This repository is the official implementation of Deep Variance Weighting for the Maze experiments in Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice.

“”

“”

Requirements

Install dependencies by:

# make sure you are in Variance-Weighted-MDVI/Deep-Variance-Weighting-Maze
poetry install

You can test if everything works by:

poetry run python experiments/run_generative.py --weight_mode dvw --seed 0 --maze_seed 1 --maze_eps 0.1 --iteration 10000 --maze_size 25 --num_pitfall 8 --num_samples_target 3 --is_munchausen

Run Maze Experiments

Run bash experiments/start_job.bash

(Optional) experiments/results.zip contains the results of bash experiments/start_job.bash. Unzip the zip-file inside the experiments folder to skip experiments.

Plot results

Run poetry run python experiments/plot_all_results.py The figures will be saved in experiments/results/0.1-3/optimality-gap.png and experiments/results/0.1-10/optimality-gap.png.

About

Official Implementation of "Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice"

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published