I don't understand why trainExamplesHistory is not cleared between iterations #251

Racines · 2021-09-08T20:11:54Z

Hello,

I see that the trainExamplesHistory in the Coach.py is never cleared, even when we accept a new model after the pit (line 126).
I don't understand why we are keeping the previous training data, where the stored policy (pi) and result value (v) will not be the same if evaluated by the new model.
It looks like we are continuing to train the new model with deprecated data.

Can someone explain the reason why?

yunjiangster · 2021-10-31T05:55:29Z

Using data from earlier iteration could help smooth the training progress and add more diversity, since the earlier models may be only slightly suboptimal compared to the most recent.

Racines changed the title ~~I don't understand why trainExamplesHistory is not clear between iterations~~ I don't understand why trainExamplesHistory is not cleared between iterations Sep 8, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

I don't understand why trainExamplesHistory is not cleared between iterations #251

I don't understand why trainExamplesHistory is not cleared between iterations #251

Racines commented Sep 8, 2021

yunjiangster commented Oct 31, 2021

I don't understand why trainExamplesHistory is not cleared between iterations #251

I don't understand why trainExamplesHistory is not cleared between iterations #251

Comments

Racines commented Sep 8, 2021

yunjiangster commented Oct 31, 2021