Trained with Unity's implentation of PPO on various board sizes.
Shown are nine independent parallel randomized instantiations on an 8x8 board.
Block Color | Game Object |
---|---|
Red | Snake Head |
White | Snake Body |
Yellow | Food Pellet |
Steps are incremented each time the agent takes an action. Score is incremented each time the snake eats a food pellet and increases in length by one. The current model is on average able to consistently acheive a score around half the total size of the board (e.g. length of 32 for board size of 8x8) but rarely gets far above that.