next gen(eralization) RL (building on wouterkool/attention-learn-to-route)

This project extends on former work by Kool et al. and was forked from this repo with the main goal of comparing additional reinforcement learning algorithms on the TSP and OP and analyzing there generalization capabilities with regard to unlearned problem sizes.

directory overview

args/ contains all configuration arguments of started experiments
custom_classes/ contains custom tianshou classes
eval_logs/ contains optionally saved logs of evaluation runs of trained policies
figure_metas/ contains metadata for saved figures for easy adjustments to existing figures
figures/ contains created figures
nets/ contains torch modules for the attention model and value estimators e.g.
problems/ contains code for the tsp and op environments
run_configs/ contains csvs exported from experiment_configurations.numbers with specific run configurations
utils/ contains utility code
log_dir/ contains training and evaluation logs and results
policy_dir/ contains trained tianshou policies that can be used for evaluation runs

dependency management

Dependencies are managed using pip-tools. Add new dependencies to requirements.in and run pip-compile to update requirements.txt.

remote execution tips

expandable

tmux command examples for remote server

tmux new -s Kenneth
tmux attach

Ctrl+B [ # scroll mode, q to quit
Ctrl+B D # detach
Ctrl+B W # window overview
Ctrl+B C # create window
Ctrl+D   # delete window

copy data between remote server and local repo quickly using rsync

rsync -avP . <USER>@<SERVER>:<FULL_PATH>/attention-next-gen-rl --delete --exclude-from rsync_excludes.txt
rsync -avP <USER>@<SERVER>:<FULL_PATH>/attention-next-gen-rl/log_dir

set up new experiments

All experiment configurations are set up in experiment_configurations.numbers. The tables can be exported to csv and saved in the run_configs/ directory. After copying this directory to the remote server, experiments can be run.

run command examples

make sure all necessary folders exist on remote (like policy_dir)

bash scheduler.sh run_configs/<file>.csv <x_first_experiments>
python3 run.py --args_from_csv run_configs/<file>.csv --csv_row <row_id> --gpu_id 0

The best policies are saved to the policy_dir directory.

eval command examples

python3 run.py --saved_policy_path policy_dir/run_127__20230823T094935.pth --gpu_id 0

preview log data using tensorboard

tensorboard --logdir log_dir/ --reload_multifile TRUE

preparing log data for visualizations/plots

git clone https://github.com/Kenneth-Schroeder/tensorboard-aggregator
python3 aggregator.py --path ../attention-next-gen-rl/log_dir/trainings

visualize/plot log data using code in plotting.ipynb

Plots are saved in figures/ and corresponding meta data to each plot is saved in figure_metas/.

collecting comparison data for Kool et al.'s version

Please use the branch master_bench for collecting comparison data for Kool et al.'s version.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

next gen(eralization) RL (building on wouterkool/attention-learn-to-route)

directory overview

dependency management

remote execution tips

tmux command examples for remote server

copy data between remote server and local repo quickly using rsync

set up new experiments

run command examples

eval command examples

preview log data using tensorboard

preparing log data for visualizations/plots

visualize/plot log data using code in plotting.ipynb

collecting comparison data for Kool et al.'s version

About

Releases 4

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 79 Commits
args		args
custom_classes		custom_classes
figure_metas		figure_metas
figures		figures
log_dir		log_dir
nets		nets
policy_dir		policy_dir
problems		problems
run_configs/ker		run_configs/ker
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
animating.ipynb		animating.ipynb
experiment_configurations.numbers		experiment_configurations.numbers
options.py		options.py
plotting.ipynb		plotting.ipynb
requirements.in		requirements.in
requirements.txt		requirements.txt
rsync_excludes.txt		rsync_excludes.txt
run.py		run.py
scheduler.sh		scheduler.sh

License

Kenneth-Schroeder/attention-next-gen-rl

Folders and files

Latest commit

History

Repository files navigation

next gen(eralization) RL (building on wouterkool/attention-learn-to-route)

directory overview

dependency management

remote execution tips

tmux command examples for remote server

copy data between remote server and local repo quickly using rsync

set up new experiments

run command examples

eval command examples

preview log data using tensorboard

preparing log data for visualizations/plots

visualize/plot log data using code in plotting.ipynb

collecting comparison data for Kool et al.'s version

About

Resources

License

Stars

Watchers

Forks

Releases 4

Packages 0

Languages

Packages