Add t4 for llm perf leaderboard #238

baptistecolle · 2024-08-01T08:09:26Z

Summary

This PR adds Nvidia T4 to the LLM-Perf Leaderboard:

Fixes

Fixed the trust_remote_code issue that was broken for the CI/CD pipeline.

Features

Added a new machine configuration (T4) to run the LLM performance benchmarking code.
Updated the A100 configuration to use the new runs-on definition (without tags).

Related PRs and Discussions

Workflow Trigger Changes

One thing that I am unsure about is that I modified the trigger for the workflow. To reduce unnecessary compute, you can manually trigger the workflow, and it is triggered with each new release of the repo (releases).

I think this could be better. What do you think, @IlyasMoutawwakil?

IlyasMoutawwakil · 2024-08-15T11:38:45Z

.github/workflows/update_llm_perf_cuda_pytorch.yaml

+  workflow_dispatch:  # Manual trigger
+  release:            # Trigger on new release
+    types: [published]  


I don't think this needs commenting, and why on release ?

Ok, I can remove the comments

Good question. I think it would be more efficient to run the full benchmark with each release of the pip package, rather than on a daily basis. Running it daily seems wasteful, as the hardware remains unchanged, and we’re simply repeating the benchmark for every code change. Since users are likely to benchmark using the PyPI package, it makes more sense to align this workflow with each release. We could also run them manually if we discover any issues with our benchmarks. However, if you prefer running the benchmark daily, I can revert to that schedule. Just let me know your preference

i guess there's a misunderstanding. the daily trigger runs different benchmarks (different model+opt+quant) each time because it skips already benchmarked configurations. it is also a way to benchmark all configurations without being limited by the 6 hours time constraint of runners.

Thanks for the explanation, it makes much more sense now. I removed the release schedule and left the original one

baptistecolle added 14 commits July 31, 2024 12:10

add t4 to leaderboard

12d4522

add t4 to leaderboard

889c79d

add t4 to leaderboard

7a8ff3f

add t4 to leaderboard

5cb271d

add t4 to leaderboard

c5f7529

add t4 to leaderboard

63ef607

add t4 to leaderboard

d9d0d9b

add t4 to leaderboard

cac022e

add t4 to leaderboard

517ed07

add t4 to leaderboard

a4272c3

add t4

6a4bf71

fix broken github action

626f613

fixed none working trust_remote_code config

104feb2

remove debugging code

2d20a45

baptistecolle changed the title ~~WIP: Add t4 for llm perf leaderboard~~ Add t4 for llm perf leaderboard Aug 8, 2024

IlyasMoutawwakil closed this Aug 12, 2024

IlyasMoutawwakil reopened this Aug 12, 2024

baptistecolle marked this pull request as ready for review August 12, 2024 10:22

IlyasMoutawwakil reviewed Aug 15, 2024

View reviewed changes

baptistecolle added 4 commits August 15, 2024 14:17

Update update_llm_perf_cuda_pytorch.yaml

d6cdb5f

Update update_llm_perf_cuda_pytorch.yaml

7200355

Fix time schedule

5670177

fix schedule

8537a67

IlyasMoutawwakil merged commit 0b69851 into main Aug 19, 2024
25 of 28 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add t4 for llm perf leaderboard #238

Add t4 for llm perf leaderboard #238

baptistecolle commented Aug 1, 2024 •

edited

Loading

IlyasMoutawwakil Aug 15, 2024 •

edited

Loading

baptistecolle Aug 15, 2024

IlyasMoutawwakil Aug 19, 2024

baptistecolle Aug 19, 2024

Add t4 for llm perf leaderboard #238

Add t4 for llm perf leaderboard #238

Conversation

baptistecolle commented Aug 1, 2024 • edited Loading

Summary

Fixes

Features

Related PRs and Discussions

Workflow Trigger Changes

IlyasMoutawwakil Aug 15, 2024 • edited Loading

Choose a reason for hiding this comment

baptistecolle Aug 15, 2024

Choose a reason for hiding this comment

IlyasMoutawwakil Aug 19, 2024

Choose a reason for hiding this comment

baptistecolle Aug 19, 2024

Choose a reason for hiding this comment

baptistecolle commented Aug 1, 2024 •

edited

Loading

IlyasMoutawwakil Aug 15, 2024 •

edited

Loading