Skip to content

Commit

Permalink
no code examples (#429)
Browse files Browse the repository at this point in the history
* wip

* update

* done
  • Loading branch information
RR4787 authored Aug 17, 2023
1 parent b7b86ad commit 29e298c
Show file tree
Hide file tree
Showing 4 changed files with 45 additions and 14 deletions.
4 changes: 3 additions & 1 deletion examples/inference-deployments/mpt/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -7,9 +7,11 @@ Check out [this blog post](https://www.mosaicml.com/blog/mpt-7b) for more inform
You’ll find in this folder:

- Model YAMLS - read [docs](https://docs.mosaicml.com/projects/mcli/en/latest/inference/inference_schema.html) for an explanation of each field.
- `mpt_7b.yaml` - an optimized no-code yaml to deploy [MPT-7B Base](https://huggingface.co/mosaicml/mpt-7b).
- `mpt_30b.yaml` - an optimized no-code yaml to deploy [MPT-30B Base](https://huggingface.co/mosaicml/mpt-30b).
- `mpt_30b_ft.yaml` - a yaml to deploy [MPT-30B Base](https://huggingface.co/mosaicml/mpt-30b).
- `mpt_30b_instruct_ft.yaml` - a yaml to deploy [MPT-30B Instruct](https://huggingface.co/mosaicml/mpt-30b-instruct).
- `mpt_7b.yaml` - a yaml to deploy [MPT-7B Base](https://huggingface.co/mosaicml/mpt-7b).
- `mpt_7b_custom.yaml` - a custom yaml to deploy [MPT-7B Base](https://huggingface.co/mosaicml/mpt-7b).
- `mpt_7b_instruct.yaml` - a yaml to deploy [MPT-7B Intstruct](https://huggingface.co/mosaicml/mpt-7b-instruct).
- `mpt_7b_storywriter.yaml` - a yaml to deploy [MPT-7B StoryWriter](https://huggingface.co/mosaicml/mpt-7b-storywriter).
- Model handlers - these define how your model should be loaded and how the model should be run when receiving a request. You can use the default handlers here or write your custom model handler as per instructions [here](https://docs.mosaicml.com/projects/mcli/en/latest/inference/deployment_features.html#custom-model-handlers).
Expand Down
17 changes: 17 additions & 0 deletions examples/inference-deployments/mpt/mpt_30b.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,17 @@
name: mpt-30b-simple
compute:
gpus: 2
gpu_type: a100_40gb
image: mosaicml/inference:0.1.16
replicas: 1
command: |
export PYTHONPATH=/code/llm-foundry:/code
integrations:
- integration_type: git_repo
git_repo: mosaicml/llm-foundry
git_commit: 496b50bd588b1a7231fe54b05d70babb3620fc72
ssh_clone: false
default_model:
model_type: mpt-30b
checkpoint_path:
hf_path: mosaicml/mpt-30b
19 changes: 6 additions & 13 deletions examples/inference-deployments/mpt/mpt_7b.yaml
Original file line number Diff line number Diff line change
@@ -1,19 +1,12 @@
name: mpt-7b
name: mpt-7b-simple
compute:
gpus: 1
instance: oci.vm.gpu.a10.1
image: mosaicml/inference:0.1.16
replicas: 1
command: |
export PYTHONPATH=$PYTHONPATH:/code/examples
integrations:
- integration_type: git_repo
git_repo: mosaicml/examples
ssh_clone: false
git_commit: df65ce9448f2e4c7803f7082930f80c8dc4e8fe1
model:
download_parameters:
export PYTHONPATH=/code
replicas: 1
default_model:
model_type: mpt-7b
checkpoint_path:
hf_path: mosaicml/mpt-7b
model_handler: examples.inference-deployments.mpt.mpt_handler.MPTModelHandler
model_parameters:
model_name: mosaicml/mpt-7b
19 changes: 19 additions & 0 deletions examples/inference-deployments/mpt/mpt_7b_custom.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,19 @@
name: mpt-7b
compute:
gpus: 1
instance: oci.vm.gpu.a10.1
image: mosaicml/inference:0.1.16
replicas: 1
command: |
export PYTHONPATH=$PYTHONPATH:/code/examples
integrations:
- integration_type: git_repo
git_repo: mosaicml/examples
ssh_clone: false
git_commit: df65ce9448f2e4c7803f7082930f80c8dc4e8fe1
model:
download_parameters:
hf_path: mosaicml/mpt-7b
model_handler: examples.inference-deployments.mpt.mpt_handler.MPTModelHandler
model_parameters:
model_name: mosaicml/mpt-7b

0 comments on commit 29e298c

Please sign in to comment.