Snake-AI

This project aims to use deep reinforcement learning (DRL) to play Snake game automatically. The core DRL method used here is PPO for discrete, which has brilliant performance in the field of discrete action space like in continuous action space. You just need half an hour to train the snake agent and then it can take effect.

Requirements

conda create -n py311 python=3.11 -y
conda activate py311
pip install -r requirements.txt

Usage

Train

python train.py # after training, the training curve of current round will autometically show
python snake.py # evaluate latest saved model

Evaluate assigned model

python evaluate.py --weight ./model/act-weight_round3_472_82.5.pkl

Plot assigned reward log

python plotter.py --history ./logs/reward_round3_82.5.csv

Experiments

Round	1	2	3
Traing curve
Evaluation
Reward_eat	+2.0	+2.0	+2.0
Reward_hit	-0.5	-1.0	-1.5
Reward_bit	-0.8	-1.5	-2.0
Avg record	≈19	≈23	≈28

Conclusions

Increasing the penalty for death leads to higher average records
The training result of the low death penalty strategy has a low reward curve, but it performs well in the demo
A particularly high reward for eating food can lead to quick success regardless of long-term safety

Future work

Training time is too short to reflect the advantages of DRL compared to none-DRL method (Snaqe)
The zigzag of snake body looks ugly, try to add punishment into reward for too many zigzags

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
logs		logs
model		model
.gitignore		.gitignore
Agent.py		Agent.py
LICENSE		LICENSE
README.md		README.md
evaluate.py		evaluate.py
painter.py		painter.py
plotter.py		plotter.py
ppo.py		ppo.py
requirements.txt		requirements.txt
snake.py		snake.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Snake-AI

Requirements

Usage

Train

Evaluate assigned model

Plot assigned reward log

Experiments

Conclusions

Future work

About

Releases 1

Languages

License

MuGemSt/Snake-AI

Folders and files

Latest commit

History

Repository files navigation

Snake-AI

Requirements

Usage

Train

Evaluate assigned model

Plot assigned reward log

Experiments

Conclusions

Future work

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 1

Languages