Replies: 2 comments 6 replies
-
You appear to not be performing the loss.backward(), optimizer.step(), or optimizer.zero_grad() calls which are needed in PyTorch training loops. Could this be causing your issue? |
Beta Was this translation helpful? Give feedback.
4 replies
-
@liecn Please see the following gist for ETCI2021 train/val script which uses raw PyTorch. I didn't experience any NaN loss or 0 train/val accuracy. Let me know if you have any further issues. |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi,
I am trying to implement the ETCI2021 training in federated learning. However, the test accuracy is 0.9 after one-round training and then changes to Nan.
I used the scripts to download the dataset (train and val) and adopt the training strategy in the repo, including the loss, optimizer, model, and hyperparameters as below. Thanks for your help. I doubt the lr is too large, so I am working on adapting it.
Beta Was this translation helpful? Give feedback.
All reactions