Skip to content

Generalisation

Latest
Compare
Choose a tag to compare
@tao-pr tao-pr released this 30 Mar 14:25
· 124 commits to master since this release

Q-learning now has its built-in generalisation using gradient descent over linear combination of policy variables. Also, add another sample falling stones to demonstrate how generalisation is used.