Add more examples for pipeline parallel inference #11372

sgwhat · 2024-06-20T06:10:15Z

Description

This PR is created to add model examples that have been evaluated.

How to test?

Local test
Unit test

plusbang · 2024-06-21T09:06:20Z

python/llm/example/GPU/Pipeline-Parallel-Inference/run_codellama_arc_2_card.sh

@@ -0,0 +1,25 @@
+source /opt/intel/oneapi/setvars.sh


Please add license for each script.

plusbang · 2024-06-21T09:08:41Z

python/llm/example/GPU/Pipeline-Parallel-Inference/run_codellama_arc_2_card.sh

+
+# To run CodeLlama-13b-Instruct-hf
+# CCL_ZE_IPC_EXCHANGE=sockets torchrun --standalone --nnodes=1 --nproc-per-node $NUM_GPUS \
+#     generate.py --repo-id-or-model-path 'codellama/CodeLlama-7b-Instruct-hf' --gpu-num $NUM_GPUS


'codellama/CodeLlama-7b-Instruct-hf' -> 'codellama/CodeLlama-13b-Instruct-hf'

) * add more model exampels for pipelien parallel inference * add mixtral and vicuna models * add yi model and past_kv supprot for chatglm family * add docs * doc update * add license * update

sgwhat added 2 commits June 20, 2024 22:08

add more model exampels for pipelien parallel inference

82f4f7d

add mixtral and vicuna models

3f92e9f

sgwhat requested a review from plusbang June 21, 2024 09:05

plusbang reviewed Jun 21, 2024

View reviewed changes

plusbang approved these changes Jun 21, 2024

View reviewed changes

sgwhat merged commit 0c67639 into intel-analytics:main Jun 21, 2024
30 of 31 checks passed

sgwhat added 5 commits June 22, 2024 00:54

add yi model and past_kv supprot for chatglm family

e3680c3

add docs

e8fa2e2

doc update

23602cb

add license

18288f0

update

ed69985

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add more examples for pipeline parallel inference #11372

Add more examples for pipeline parallel inference #11372

sgwhat commented Jun 20, 2024 •

edited

Loading

plusbang Jun 21, 2024

plusbang Jun 21, 2024

Add more examples for pipeline parallel inference #11372

Add more examples for pipeline parallel inference #11372

Conversation

sgwhat commented Jun 20, 2024 • edited Loading

Description

How to test?

plusbang Jun 21, 2024

Choose a reason for hiding this comment

plusbang Jun 21, 2024

Choose a reason for hiding this comment

sgwhat commented Jun 20, 2024 •

edited

Loading