Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improvements to the resnet cifar10 example #490

Merged
merged 1 commit into from
Aug 29, 2023

Commits on Aug 28, 2023

  1. Many improvements to the resnet flax example, including:

    1. Reaches 91% accuracy with a resnet18 model on CIFAR10, before
    we were only getting 88%. The main change that made this possible was
    to tweak the size of the convolutional kernel in the first layer, from
    7x7 (default) to 3x3. This is documented in the code, which now allows
    easily to overwrite the default kernel size.
    
    2. Runs in 50 epochs (vs 200 before), and only takes 13 min on free
    colab, vs 1 hour before.
    
    3. Removed many unused variables, and cleaned up the code.
    
    4. I'm now using a warm-up learning rate, which seems to be working
    great across different architectures and datasets.
    fabianp committed Aug 28, 2023
    Configuration menu
    Copy the full SHA
    30d38e7 View commit details
    Browse the repository at this point in the history