Skip to content

Commit

Permalink
Turn on TMA by default for row-wise GEMM (#2450)
Browse files Browse the repository at this point in the history
Summary:
Pull Request resolved: #2450

X-link: facebookresearch/FBGEMM#189

Enabling the TMA row-wise GEMM by default it TMA appears to give quite some speedup across-the-board, up to 40% for some shapes.

Reviewed By: choutim

Differential Revision: D62212842

fbshipit-source-id: 59220cec90e222fe91be9f53a3477f1c38e02e2a
  • Loading branch information
htyu authored and facebook-github-bot committed Sep 10, 2024
1 parent 5ee3599 commit e95390a
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion torchbenchmark/operators/fp8_gemm_rowwise/operator.py
Original file line number Diff line number Diff line change
Expand Up @@ -24,7 +24,7 @@ def parse_args(args: List[str]) -> argparse.Namespace:
parser.add_argument(
"--no_fp8_fast_accum", dest="fp8_fast_accum", action="store_false"
)
parser.add_argument("--use_tma", action="store_true")
parser.add_argument("--no_use_tma", dest="use_tma", action="store_false")
args = parser.parse_args(args)
return args

Expand Down

0 comments on commit e95390a

Please sign in to comment.