Skip to content

Commit

Permalink
Added gemma specific fp8 quantization file
Browse files Browse the repository at this point in the history
  • Loading branch information
yeonsily committed Oct 21, 2024
1 parent c625680 commit 2376b58
Showing 1 changed file with 12 additions and 0 deletions.
Original file line number Diff line number Diff line change
@@ -0,0 +1,12 @@
{
"method": "HOOKS",
"mode": "QUANTIZE",
"observer": "maxabs",
"scale_method": "maxabs_hw",
"blocklist": {"types": [], "names": [
"matmul_qk",
"matmul_av",
"lm_head"
]},
"dump_stats_path": "./hqt_output/measure"
}

0 comments on commit 2376b58

Please sign in to comment.