You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Basic MNIST FP8 training implemented in #87
Training is working as expected, even though hard to make any conclusion from just this simple MNIST model.
We need to validate
AutoScale
FP8 training on the basic MNIST example to start with.With multiple experiments:
FP8-143
format used;FP8-152
format used;These experiments can be done relying primarily on JAX LAX
reduce_precision
operator. No need of FP8 hardware for now.The text was updated successfully, but these errors were encountered: