You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I see that the trainExamplesHistory in the Coach.py is never cleared, even when we accept a new model after the pit (line 126).
I don't understand why we are keeping the previous training data, where the stored policy (pi) and result value (v) will not be the same if evaluated by the new model.
It looks like we are continuing to train the new model with deprecated data.
Can someone explain the reason why?
The text was updated successfully, but these errors were encountered:
Racines
changed the title
I don't understand why trainExamplesHistory is not clear between iterations
I don't understand why trainExamplesHistory is not cleared between iterations
Sep 8, 2021
Using data from earlier iteration could help smooth the training progress and add more diversity, since the earlier models may be only slightly suboptimal compared to the most recent.
Hello,
I see that the trainExamplesHistory in the Coach.py is never cleared, even when we accept a new model after the pit (line 126).
I don't understand why we are keeping the previous training data, where the stored policy (pi) and result value (v) will not be the same if evaluated by the new model.
It looks like we are continuing to train the new model with deprecated data.
Can someone explain the reason why?
The text was updated successfully, but these errors were encountered: