Chess Implementation #210

goshawk22 · 2020-08-30T10:46:10Z

goshawk22
Aug 30, 2020

I have a working chess implementation here. Everything works, however, because of the complex nature of chess, it takes about ~10 minutes to play a game in self play and about 20 minutes when it evaluates the agent. This really makes training a good model unviable with the limited computing power available. I don't know how this will change if the MCTS is asynchronous but at the moment it seems that it won't work.
I am currently training a model 10 eps self play and 25 MCTS sims. I also implemented a ResNet architecture in Keras based on the official paper.
I don't know if it will speed up after the Neural Network has been trained?
Any ideas how I might speed this up?

evg-tyurin · 2020-08-31T08:16:47Z

evg-tyurin
Aug 31, 2020

Your environment is pretty slow. It takes about 1-2 minutes for an episode of chess using 100 MCTS sims if mid-level gpu (like 2070) is used.

You could reread your code to find potentially redundant instructions like https://github.com/goshawk22/alpha-zero-chess/blob/0ebedbd18dd7552a45cf27dc783eeb97713db936/localchess/ChessGame.py#L46

Furthermore, you could profile your program with python profilers like cprofile.

Besides the optimization, I see that you haven't implemented all chess rules. If your action space is 64x64 your game doesn't know about promotion to other pieces except queen.

0 replies

goshawk22 · 2020-08-31T08:53:25Z

goshawk22
Aug 31, 2020
Author

I'll have a look to see if I can find any more bottlenecks.

0 replies

goshawk22 · 2020-08-31T15:59:57Z

goshawk22
Aug 31, 2020
Author

I thought I would see if I could get it to work before I implemented any more complicated moves, and often a promotion will be a queen anyway.

0 replies

evg-tyurin · 2020-09-01T07:24:46Z

evg-tyurin
Sep 1, 2020

Unfortunately, it doesn't work even at 100-200 MCTS x 1000 episodes. I suppose, the game of chess requires much more computations per iteration to get noticeable increment of performance.

0 replies

Ettrig · 2020-09-01T10:21:08Z

Ettrig
Sep 1, 2020

You might also be interested in Issue #205 which points out a grave defect in the MCTS code.

0 replies

goshawk22 · 2020-09-01T11:50:41Z

goshawk22
Sep 1, 2020
Author

I was thinking of getting a free trial on oracle cloud and getting a good GPU for 3 days or so to train it. This might get some results.
The other thing I could try is a more basic game based off chess, like pawns.
I think the limit eventually becomes computational power with games as complex as chess.

0 replies

goshawk22 · 2020-09-01T12:30:50Z

goshawk22
Sep 1, 2020
Author

Maybe I should implement it based off your repo alpha-nagibator
Then it would have multithreading etc?

0 replies

evg-tyurin · 2020-09-01T13:02:18Z

evg-tyurin
Sep 1, 2020

It definitely has multithreading support and I was able to get good enough performance in checkers comparing to one of the best classical checkers program available in the internet. I think it'll take more than 3 days of gpu time to get to the same level of performance.

0 replies

goshawk22 · 2020-09-02T16:30:13Z

goshawk22
Sep 2, 2020
Author

Does your repo play multiple games at once or run multiple MCTS simulations at once? I wonder which would have the best performance gains.
I think I will try to implement playing multiple games at once using ray in both selfplay and arena compare to speed up training, and port your MCTS code over, which should improve training considerably.
I think I will then be able to train it on a P100 for 4-5 days or so (using free trials) and see what results I get.

0 replies

evg-tyurin · 2020-09-02T17:07:25Z

evg-tyurin
Sep 2, 2020

Multiple games at once each holding its own MCTS tree and all MCTS instances communicate with single NN model. Running multiple MCTS simulations at once is more efficient but more tricky because we should resolve collisions, I didn't implement this.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chess Implementation #210

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 12 comments 3 replies

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

Chess Implementation #210

goshawk22 Aug 30, 2020

Replies: 12 comments · 3 replies

evg-tyurin Aug 31, 2020

goshawk22 Aug 31, 2020 Author

goshawk22 Aug 31, 2020 Author

evg-tyurin Sep 1, 2020

Ettrig Sep 1, 2020

goshawk22 Sep 1, 2020 Author

goshawk22 Sep 1, 2020 Author

evg-tyurin Sep 1, 2020

goshawk22 Sep 2, 2020 Author

evg-tyurin Sep 2, 2020

goshawk22
Aug 30, 2020

Replies: 12 comments 3 replies

evg-tyurin
Aug 31, 2020

goshawk22
Aug 31, 2020
Author

goshawk22
Aug 31, 2020
Author

evg-tyurin
Sep 1, 2020

Ettrig
Sep 1, 2020

goshawk22
Sep 1, 2020
Author

goshawk22
Sep 1, 2020
Author

evg-tyurin
Sep 1, 2020

goshawk22
Sep 2, 2020
Author

evg-tyurin
Sep 2, 2020