-
Notifications
You must be signed in to change notification settings - Fork 1.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
test: Test model_metrics config and document histogram buckets override #7752
Conversation
64a5623
to
65c736e
Compare
65c736e
to
4af437e
Compare
|
||
SERVER_ARGS="--model-repository=${MODELDIR} --model-control-mode=explicit --load-model=${decoupled_model} --metrics-config histogram_latencies=true --log-verbose=1" | ||
run_and_check_server | ||
export OVERRIDE_BUCKETS="-1,0,1,2.5,+Inf" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You are still hard-coding the buckets value..
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hard coded here because it is non-trivial work to convert a double in string to the displayed bucket "le" value in Prometheus.
0.00 -> le="0"
1.0100 -> le="1.01"
100 -> le="100"
000 -> le="0"
1. -> le="1"
qa/L0_model_config/model_metrics/valid_config_with_warn/unknown_metric_family/expected
Outdated
Show resolved
Hide resolved
… yinggeh-DLIS-7457-per-model-metric-config
What does the PR do?
Tests new model_metrics message in pbtxt. Add example to override histogram buckets per-family.
Checklist
<commit_type>: <Title>
Commit Type:
Check the conventional commit type
box here and add the label to the github PR.
Related PRs:
triton-inference-server/core#405
triton-inference-server/common#126
Where should the reviewer start?
Test plan:
L0_metrics
20106057
Caveats:
Background
Default histogram buckets does not satisfy all use cases.