Skip to content

Commit

Permalink
chore: minor
Browse files Browse the repository at this point in the history
  • Loading branch information
chenzhuofu committed Sep 5, 2024
1 parent 13850bb commit 30d17a2
Showing 1 changed file with 2 additions and 1 deletion.
3 changes: 2 additions & 1 deletion src/ops/inc_multihead_self_attention.cu
Original file line number Diff line number Diff line change
Expand Up @@ -506,7 +506,8 @@ IncMultiHeadSelfAttentionMeta::IncMultiHeadSelfAttentionMeta(
size_t max_post_pos_enc_pages =
round_up_pages(BatchConfig::MAX_STREAMING_POS -
BatchConfig::get_max_tree_depth() +
BatchConfig::max_spec_tree_token_num());
max(BatchConfig::max_tokens_per_batch(),
BatchConfig::max_spec_tree_token_num()));
key_cache_size = num_kv_heads * qk_dim *
BatchConfig::max_requests_per_batch() *
max_post_pos_enc_pages * kPagesize;
Expand Down

0 comments on commit 30d17a2

Please sign in to comment.