Added codellama #2146

MaanavD · 2024-01-30T22:26:49Z

Adding codellama model to canary.

(doesn't run on 16GB GPU)

xuzhao9 · 2024-01-30T23:14:43Z

I am good with adding it to canary.
Just asking, does it run on single 40GB A100?

MaanavD · 2024-01-31T19:15:37Z

@xuzhao9 it runs on A100 :)

$ python run.py codellama -d cuda
Warning: The model codellama cannot be found at core set.
/workspace/bowbao/onnxbench/transformers/src/transformers/utils/hub.py:124: FutureWarning: Using TRANSFORMERS_CACHE is deprecated and will be removed in v5 of Transformers. Use HF_HOME instead.
warnings.warn(
config.json: 100%|████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 637/637 [00:00<00:00, 3.35MB/s]
Running eval method from codellama on cuda in eager mode with input batch size 1 and precision fp16.
GPU Time per batch: 45.310 milliseconds
CPU Wall Time per batch: 45.340 milliseconds
Time to first batch: 75980.8640 ms
GPU 0 Peak Memory: 18.8482 GB
CPU Peak Memory: 1.4941 GB

xuzhao9 · 2024-01-31T20:00:55Z

@MaanavD Nice! In this case, I suggest we add this model to models/ instead of canary. We should disable cpu test because it is too slow and will time out. For A10G, it should not OOM because peak GPU memory is 18GB (A10G has 24GB), but we could also disable it if there is OOM.

xuzhao9 · 2024-02-23T17:20:33Z

LGTM

facebook-github-bot · 2024-02-23T17:20:51Z

@xuzhao9 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2024-02-23T19:50:10Z

@xuzhao9 merged this pull request in 4386604.

Added codellama (testing, in canary).

e190dda

facebook-github-bot added the cla signed label Jan 30, 2024

MaanavD temporarily deployed to docker-s3-upload January 31, 2024 01:02 — with GitHub Actions Inactive

MaanavD temporarily deployed to docker-s3-upload January 31, 2024 03:33 — with GitHub Actions Inactive

Moved codellama out of canary.

90d4712

MaanavD had a problem deploying to docker-s3-upload February 1, 2024 03:17 — with GitHub Actions Failure

Moved codellama back into canary.

d40c111

MaanavD temporarily deployed to docker-s3-upload February 23, 2024 03:31 — with GitHub Actions Inactive

xuzhao9 approved these changes Feb 23, 2024

View reviewed changes

facebook-github-bot closed this in 4386604 Feb 23, 2024

facebook-github-bot added the Merged label Feb 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added codellama #2146

Added codellama #2146

MaanavD commented Jan 30, 2024

xuzhao9 commented Jan 30, 2024

MaanavD commented Jan 31, 2024

xuzhao9 commented Jan 31, 2024

xuzhao9 commented Feb 23, 2024

facebook-github-bot commented Feb 23, 2024

facebook-github-bot commented Feb 23, 2024

Added codellama #2146

Added codellama #2146

Conversation

MaanavD commented Jan 30, 2024

xuzhao9 commented Jan 30, 2024

MaanavD commented Jan 31, 2024

xuzhao9 commented Jan 31, 2024

xuzhao9 commented Feb 23, 2024

facebook-github-bot commented Feb 23, 2024

facebook-github-bot commented Feb 23, 2024