add `npu_group_size` for `transformers_int4_npu_win` in all-in-one benchmark api #12316

ch1y0q · 2024-11-01T10:13:53Z

small bugfix

Description

1. Why the change?

2. User API changes

3. Summary of the change

4. How to test?

N/A
Unit test: Please manually trigger the PR Validation here by inputting the PR number (e.g., 1234). And paste your action link here once it has been successfully finished.
Application test
Document test
...

5. New dependencies

New Python dependencies
- Dependency1
- Dependency2
- ...
New Java/Scala dependencies and their license
- Dependency1 and license1
- Dependency2 and license2
- ...

small bugfix

rnwang04 · 2024-11-01T10:19:34Z

python/llm/dev/benchmark/all-in-one/run.py

@@ -2193,6 +2198,8 @@ def run_pipeline_parallel_gpu(repo_id,
        optimize_model = conf['optimize_model']
    if 'group_size' in conf:
        group_size = conf['group_size']


remove above if.

rnwang04

others LGTM.

Oscilloscope98 · 2024-11-01T10:30:29Z

python/llm/dev/benchmark/all-in-one/run.py

@@ -214,7 +214,8 @@ def run_model(repo_id, test_api, in_out_pairs, local_model_hub=None, warm_up=1,
                            round(result[in_out_pair][-1][5], 2),
                            result[in_out_pair][-1][6] if any(keyword in test_api for keyword in ['int4_gpu', 'int4_fp16_gpu_win', 'int4_loadlowbit_gpu', 'int4_fp16_loadlowbit_gpu', 'fp16_gpu', 'deepspeed_optimize_model_gpu']) and not lookahead else 'N/A',
                            streaming if 'win' in test_api else 'N/A',
-                            use_fp16_torch_dtype if 'pipeline_parallel_gpu' in test_api else 'N/A'],
+                            use_fp16_torch_dtype if 'pipeline_parallel_gpu' in test_api else 'N/A',
+                            group_size],


Should we make group_size value "N/A" for other test_api? Maybe confusing

Good point, we will fix this in next PR . 😊

add npu_group_size for transformers_int4_npu_win

b6c751e

small bugfix

rnwang04 reviewed Nov 1, 2024

View reviewed changes

rnwang04 requested review from Oscilloscope98 and cyita November 1, 2024 10:20

rnwang04 approved these changes Nov 1, 2024

View reviewed changes

rnwang04 changed the title ~~add npu_group_size for transformers_int4_npu_win~~ add npu_group_size for transformers_int4_npu_win in all-in-one benchmark api Nov 1, 2024

update

e805385

cyita approved these changes Nov 1, 2024

View reviewed changes

Oscilloscope98 reviewed Nov 1, 2024

View reviewed changes

rnwang04 merged commit 48123af into intel-analytics:main Nov 1, 2024
1 check passed

ch1y0q deleted the npu_group_size branch November 4, 2024 02:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add `npu_group_size` for `transformers_int4_npu_win` in all-in-one benchmark api #12316

add `npu_group_size` for `transformers_int4_npu_win` in all-in-one benchmark api #12316

ch1y0q commented Nov 1, 2024

rnwang04 Nov 1, 2024

rnwang04 left a comment

Oscilloscope98 Nov 1, 2024

rnwang04 Nov 1, 2024

add npu_group_size for transformers_int4_npu_win in all-in-one benchmark api #12316

add npu_group_size for transformers_int4_npu_win in all-in-one benchmark api #12316

Conversation

ch1y0q commented Nov 1, 2024

Description

1. Why the change?

2. User API changes

3. Summary of the change

4. How to test?

5. New dependencies

rnwang04 Nov 1, 2024

Choose a reason for hiding this comment

rnwang04 left a comment

Choose a reason for hiding this comment

Oscilloscope98 Nov 1, 2024

Choose a reason for hiding this comment

rnwang04 Nov 1, 2024

Choose a reason for hiding this comment

add `npu_group_size` for `transformers_int4_npu_win` in all-in-one benchmark api #12316

add `npu_group_size` for `transformers_int4_npu_win` in all-in-one benchmark api #12316