Skip to content

Reorder load and scaling code to allow latency hidding for block-wise scaled GEMMs #3728

Reorder load and scaling code to allow latency hidding for block-wise scaled GEMMs

Reorder load and scaling code to allow latency hidding for block-wise scaled GEMMs #3728

Annotations

2 warnings

pytorch/FBGEMM  /  ...  /  manywheel-py3_8-cuda12_1

succeeded May 20, 2024 in 41s