Skip to content

Actions: huggingface/optimum-benchmark

CLI CUDA TensorRT-LLM Tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
425 workflow runs
425 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Refactor backends and add load tracking
CLI CUDA TensorRT-LLM Tests #301: Pull request #227 synchronize by IlyasMoutawwakil
July 11, 2024 08:55 5m 31s refactor-backends
July 11, 2024 08:55 5m 31s
Refactor backends and add load tracking
CLI CUDA TensorRT-LLM Tests #300: Pull request #227 synchronize by IlyasMoutawwakil
July 11, 2024 08:43 5m 46s refactor-backends
July 11, 2024 08:43 5m 46s
Refactor backends and add load tracking
CLI CUDA TensorRT-LLM Tests #299: Pull request #227 synchronize by IlyasMoutawwakil
July 11, 2024 08:34 5m 51s refactor-backends
July 11, 2024 08:34 5m 51s
Refactor backends and add load tracking
CLI CUDA TensorRT-LLM Tests #298: Pull request #227 opened by IlyasMoutawwakil
July 11, 2024 08:09 5m 50s refactor-backends
July 11, 2024 08:09 5m 50s
Per token latency outliers (#225)
CLI CUDA TensorRT-LLM Tests #297: Commit 7999050 pushed by IlyasMoutawwakil
July 3, 2024 15:28 11m 31s main
July 3, 2024 15:28 11m 31s
Per token latency outliers
CLI CUDA TensorRT-LLM Tests #296: Pull request #225 opened by IlyasMoutawwakil
July 3, 2024 15:24 10m 31s better-per-token-latency
July 3, 2024 15:24 10m 31s
Patch release (#224)
CLI CUDA TensorRT-LLM Tests #295: Commit 8ebe853 pushed by IlyasMoutawwakil
July 3, 2024 14:45 11m 22s main
July 3, 2024 14:45 11m 22s
Patch release
CLI CUDA TensorRT-LLM Tests #294: Pull request #224 opened by IlyasMoutawwakil
July 3, 2024 14:45 7m 50s IlyasMoutawwakil-patch-2
July 3, 2024 14:45 7m 50s
Fix per token latency (#223)
CLI CUDA TensorRT-LLM Tests #293: Commit 2a75c0b pushed by IlyasMoutawwakil
July 3, 2024 14:05 10m 26s main
July 3, 2024 14:05 10m 26s
Fix per token latency
CLI CUDA TensorRT-LLM Tests #292: Pull request #223 opened by IlyasMoutawwakil
July 3, 2024 13:33 10m 20s fix-per-token-latency
July 3, 2024 13:33 10m 20s
bump version 0.3.0 (#221)
CLI CUDA TensorRT-LLM Tests #291: Commit 19eeac5 pushed by IlyasMoutawwakil
July 2, 2024 10:15 14m 3s main
July 2, 2024 10:15 14m 3s
bump version 0.3.0
CLI CUDA TensorRT-LLM Tests #290: Pull request #221 opened by IlyasMoutawwakil
July 2, 2024 10:15 8m 3s IlyasMoutawwakil-patch-1
July 2, 2024 10:15 8m 3s
Fix INC (#220)
CLI CUDA TensorRT-LLM Tests #289: Commit 3731aa1 pushed by IlyasMoutawwakil
July 2, 2024 09:57 10m 17s main
July 2, 2024 09:57 10m 17s
Fix INC
CLI CUDA TensorRT-LLM Tests #288: Pull request #220 synchronize by IlyasMoutawwakil
July 2, 2024 09:42 10m 8s fix-inc
July 2, 2024 09:42 10m 8s
Fix INC
CLI CUDA TensorRT-LLM Tests #287: Pull request #220 synchronize by IlyasMoutawwakil
July 2, 2024 09:18 10m 21s fix-inc
July 2, 2024 09:18 10m 21s
Fix INC
CLI CUDA TensorRT-LLM Tests #286: Pull request #220 opened by IlyasMoutawwakil
July 1, 2024 18:07 10m 23s fix-inc
July 1, 2024 18:07 10m 23s
Pin eager attn in torch-ort backend (#219)
CLI CUDA TensorRT-LLM Tests #285: Commit dd02f26 pushed by IlyasMoutawwakil
July 1, 2024 16:40 10m 26s main
July 1, 2024 16:40 10m 26s
Pin eager attn in torch-ort backend
CLI CUDA TensorRT-LLM Tests #284: Pull request #219 opened by IlyasMoutawwakil
July 1, 2024 16:22 9m 5s fix-torch-ort-attn
July 1, 2024 16:22 9m 5s
Fix PyTorchBackend TP vs DP inputs distribution across replicas and…
CLI CUDA TensorRT-LLM Tests #283: Commit 156844a pushed by IlyasMoutawwakil
July 1, 2024 16:21 10m 23s main
July 1, 2024 16:21 10m 23s
Fix PyTorchBackend TP vs DP inputs distribution across replicas and shards
CLI CUDA TensorRT-LLM Tests #282: Pull request #218 synchronize by IlyasMoutawwakil
July 1, 2024 15:40 10m 19s fix-tp-vs-dp-input-split
July 1, 2024 15:40 10m 19s
Fix PyTorchBackend TP vs DP inputs distribution across replicas and shards
CLI CUDA TensorRT-LLM Tests #281: Pull request #218 synchronize by IlyasMoutawwakil
July 1, 2024 15:11 10m 22s fix-tp-vs-dp-input-split
July 1, 2024 15:11 10m 22s
Fix PyTorchBackend TP vs DP inputs distribution across replicas and shards
CLI CUDA TensorRT-LLM Tests #280: Pull request #218 synchronize by IlyasMoutawwakil
July 1, 2024 13:12 10m 31s fix-tp-vs-dp-input-split
July 1, 2024 13:12 10m 31s