Improvements to the resnet cifar10 example #490

fabianp · 2023-08-02T14:00:16Z

Reaches 91% accuracy with a resnet18 model on CIFAR10, before
we were only getting 88%. The main change that made this possible was
to tweak the size of the convolutional kernel in the first layer, from
7x7 (default) to 3x3. This is documented in the code, which now allows
easily to overwrite the default kernel size.
Runs in 50 epochs (vs 200 before), and only takes 13 min on free
colab, vs 1 hour before.
Removed many unused variables, and cleaned up the code.
I'm now using a warm-up learning rate, which seems to be working
great across different architectures and datasets.

1. Reaches 91% accuracy with a resnet18 model on CIFAR10, before we were only getting 88%. The main change that made this possible was to tweak the size of the convolutional kernel in the first layer, from 7x7 (default) to 3x3. This is documented in the code, which now allows easily to overwrite the default kernel size. 2. Runs in 50 epochs (vs 200 before), and only takes 13 min on free colab, vs 1 hour before. 3. Removed many unused variables, and cleaned up the code. 4. I'm now using a warm-up learning rate, which seems to be working great across different architectures and datasets.

mblondel

Thank you so much for the amazing improvements. LGTM apart from the minor comment below.

mblondel · 2023-08-28T13:48:56Z

docs/notebooks/deep_learning/resnet_flax.md

-:id: onkVLRw7L3j4
+:id: P7Z3Vex8QuGz
+
+ModuleDef = Any


What do we gain from this definition? It wasn't entirely clear to me while reading the code. Maybe add a quick comment explaining why you introduce this.

TBH I'm doing it because flax does it that way (https://github.com/google/flax/blob/main/examples/imagenet/models.py). I guess conv: ModuleDef is more explicit (for a human) than conv: Any , although the parser will treat them equally?

If that's the Flax way, fine with me!

fabianp force-pushed the resnet_cifar10 branch 2 times, most recently from b657aad to 8faea3d Compare August 8, 2023 19:58

fabianp force-pushed the resnet_cifar10 branch from 053f918 to cd002f8 Compare August 28, 2023 13:39

fabianp changed the title ~~[work in progress, not ready for review] Random improvements to the resnet cifar10 example~~ Improvements to the resnet cifar10 example Aug 28, 2023

fabianp marked this pull request as ready for review August 28, 2023 13:43

fabianp requested a review from mblondel August 28, 2023 13:43

fabianp force-pushed the resnet_cifar10 branch from cd002f8 to 30d38e7 Compare August 28, 2023 13:45

mblondel approved these changes Aug 28, 2023

View reviewed changes

fabianp added the pull ready label Aug 29, 2023

copybara-service bot merged commit 75724b5 into google:main Aug 29, 2023
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improvements to the resnet cifar10 example #490

Improvements to the resnet cifar10 example #490

fabianp commented Aug 2, 2023 •

edited

Loading

mblondel left a comment

mblondel Aug 28, 2023

fabianp Aug 28, 2023

mblondel Aug 28, 2023

Improvements to the resnet cifar10 example #490

Improvements to the resnet cifar10 example #490

Conversation

fabianp commented Aug 2, 2023 • edited Loading

mblondel left a comment

Choose a reason for hiding this comment

mblondel Aug 28, 2023

Choose a reason for hiding this comment

fabianp Aug 28, 2023

Choose a reason for hiding this comment

mblondel Aug 28, 2023

Choose a reason for hiding this comment

fabianp commented Aug 2, 2023 •

edited

Loading