Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[MLAS][AArch64] SQNBitGemm CompInt8 - Use 4x2 tiles (#21380)
Update SQNBitGemm ARM NEON kernel to compute 4x2 tile of output. Note: Also tried 2x4 and 4x4 tiles but observed the best microbenchmark results with 4x2 tiles.
- Loading branch information