Add Qwen handler and fix mean_latency calculation error for OSS models #642

zhangch-ss · 2024-09-18T09:57:51Z

Hello, there may be an anomaly in the mean latency calculation of the OSS model, and I have attempted to fix it.

zhangch-ss · 2024-09-18T10:37:37Z

line 1045, berkeley-function-call-leaderboard/bfcl/eval_checker/eval_runner_helper.py

zhangch-ss · 2024-09-19T01:25:24Z

Qwen handler is also added in this PR

zhangch-ss · 2024-09-19T01:45:11Z

You may need to modify it according to the actual situation

"Qwen/Qwen2-1.5B-Instruct": 100,
"Qwen/Qwen2-7B-Instruct": 100,

OSS_LATENCY = {
    "deepseek-ai/deepseek-coder-6.7b-instruct": 909,
    "google/gemma-7b-it": 95,
    "NousResearch/Hermes-2-Pro-Mistral-7B": 135,
    "NousResearch/Hermes-2-Pro-Llama-3-8B": 77,
    "NousResearch/Hermes-2-Theta-Llama-3-8B": 73,
    "NousResearch/Hermes-2-Theta-Llama-3-70B": 716,
    "NousResearch/Hermes-2-Pro-Llama-3-70B": 674,
    "meta-llama/Meta-Llama-3-8B-Instruct": 73,
    "meta-llama/Meta-Llama-3-70B-Instruct": 307,
    "gorilla-openfunctions-v2": 83,
    "THUDM/glm-4-9b-chat": 223,
    "Qwen/Qwen2-1.5B-Instruct": 100,
    "Qwen/Qwen2-7B-Instruct": 100,
}

zhangch-ss · 2024-09-19T03:05:08Z

Now it supports the newly released Qwen2.5

HuanzhiMao · 2024-09-29T09:43:42Z

Apologize for the long delay. Will definitely take a look after the Monday ICLR deadline.

zhangch-ss · 2024-09-30T03:07:05Z

Apologize for the long delay. Will definitely take a look after the Monday ICLR deadline.

thanks, hope all is well

HuanzhiMao

Thanks for the PR @zhangch-ss and looking forward to the amazing Qwen models.
I made a few changes to your PR so that the handler is compatible with the latest codebase.
Regarding the mean/latency calculation, I will update those in a separate PR, as that involves quite a bit of change.

This PR updates the leaderboard to reflect the change in score due to the following PR merge: 1. #660 2. #661 3. #683 4. #679 5. #708 6. #709 7. #701 8. #657 9. #658 10. #640 11. #653 12. #642 13. #696 14. #667 Close #662. Note: Some models (like `firefunction`, `functionary`, `microsoft/phi`)are not included in this leaderboard update because we don't have all the entries generated. We will add them back once we get the full result generated.

ShishirPatil#642) Hello, there may be an anomaly in the mean latency calculation of the OSS model, and I have attempted to fix it. --------- Co-authored-by: ai_user <ai@digitalchina.com> Co-authored-by: Huanzhi (Hans) Mao <huanzhimao@gmail.com>

ai_user added 2 commits September 18, 2024 09:35

Fix the mean_latency calculation error in the OSS model

ec8af40

Add Qwen handler

c85d6b2

update Qwen handler

2f6dd71

zhangch-ss changed the title ~~Fix the mean_latency calculation error in the OSS model~~ Add Qwen handler and fix mean_latency calculation error for OSS models Sep 19, 2024

update qwen2.5

3736a89

HuanzhiMao added 3 commits October 4, 2024 15:54

Merge branch 'main' into pr/zhangch-ss/642

7fdcd24

update QwenHandler to be in sync with

5be4078

update change log and supported model list

d71cb55

HuanzhiMao approved these changes Oct 5, 2024

View reviewed changes

Merge branch 'main' into pr/zhangch-ss/642

ce1ead8

HuanzhiMao added the BFCL-New Model Add New Model to BFCL label Oct 5, 2024

ShishirPatil merged commit b4edd01 into ShishirPatil:main Oct 5, 2024

HuanzhiMao mentioned this pull request Oct 5, 2024

[BFCL] Leaderboard Update, 10/21/2024 #672

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Qwen handler and fix mean_latency calculation error for OSS models #642

Add Qwen handler and fix mean_latency calculation error for OSS models #642

zhangch-ss commented Sep 18, 2024

zhangch-ss commented Sep 18, 2024

zhangch-ss commented Sep 19, 2024

zhangch-ss commented Sep 19, 2024

zhangch-ss commented Sep 19, 2024

HuanzhiMao commented Sep 29, 2024

zhangch-ss commented Sep 30, 2024

HuanzhiMao left a comment

Add Qwen handler and fix mean_latency calculation error for OSS models #642

Add Qwen handler and fix mean_latency calculation error for OSS models #642

Conversation

zhangch-ss commented Sep 18, 2024

zhangch-ss commented Sep 18, 2024

zhangch-ss commented Sep 19, 2024

zhangch-ss commented Sep 19, 2024

zhangch-ss commented Sep 19, 2024

HuanzhiMao commented Sep 29, 2024

zhangch-ss commented Sep 30, 2024

HuanzhiMao left a comment

Choose a reason for hiding this comment