Skip to content

Commit

Permalink
[TUTORIALS] persistent kernel - fp8 matmul (#4099)
Browse files Browse the repository at this point in the history
Including performance comparison between naive matmul (improved version
of tutorial matmul), cuBLAS implementation, persistent kernel w/o and w/
TMA.
  • Loading branch information
pawelszczerbuk authored Jun 8, 2024
1 parent da99faf commit 85c7e15
Show file tree
Hide file tree
Showing 2 changed files with 406 additions and 1 deletion.
Loading

0 comments on commit 85c7e15

Please sign in to comment.