Skip to content

Commit

Permalink
Speed up KV in llamafile-bench
Browse files Browse the repository at this point in the history
  • Loading branch information
jart committed Aug 29, 2024
1 parent d2be732 commit c7c4d65
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions llama.cpp/llama-bench/llama-bench.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -265,8 +265,8 @@ static const cmd_params cmd_params_defaults = {
/* n_pg */ {},
/* n_batch */ {2048},
/* n_ubatch */ {512},
/* type_k */ {GGML_TYPE_F16},
/* type_v */ {GGML_TYPE_F16},
/* type_k */ {X86_HAVE(AVX512_BF16) ? GGML_TYPE_BF16 : GGML_TYPE_F16},
/* type_v */ {X86_HAVE(AVX512_BF16) ? GGML_TYPE_BF16 : GGML_TYPE_F16},
/* n_threads */ {cpu_get_num_math()},
/* n_gpu_layers */ {0},
/* split_mode */ {LLAMA_SPLIT_MODE_LAYER},
Expand Down

1 comment on commit c7c4d65

@Djip007
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

👍

Please sign in to comment.