CartPole

A simple project to try out some learning strategies on Open AI Gym's CartPole-v0 environment :

a solution based on genetic algorithm;
a k nearest-neighbors strategy.

Genetic Algorithm

The idea is simplistic: make a fixed-sized population of agents compete. At every generation, survivors are selected stochastically according to their fitness (in this case their overall reward), and mutate into the next generation.

An Agent receives information about its environment (the observation variable returned by the step() method of the environment), and makes a decision on the action to take.

In our case, and Agent's brain consists of a simple two-layer neural network, which is fed the observation output by the environment at the end of each step. The genetic algorithm aims at determining the best parameters for the neural network.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.gitignore		.gitignore
CartPole.ipynb		CartPole.ipynb
Jupyter Notebook.command		Jupyter Notebook.command
debug.py		debug.py
genetic.py		genetic.py
knn.py		knn.py
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CartPole

Genetic Algorithm

k-Nearest Neighbors

About

Releases

Packages

Languages

bdura/cartpole

Folders and files

Latest commit

History

Repository files navigation

CartPole

Genetic Algorithm

k-Nearest Neighbors

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages