Update vllm backend to support offline and online serving modes #319
test_cli_cuda_tensorrt_llm.yaml
on: pull_request
cli_cuda_tensorrt_llm_tests
46s
Annotations
2 errors
cli_cuda_tensorrt_llm_tests
Canceling since a higher priority waiting request for 'CLI CUDA TensorRT-LLM Tests-232' exists
|
cli_cuda_tensorrt_llm_tests
The operation was canceled.
|