feat(docs): [SCv1] Automatically create and upload a custom HF model to seldon-models in GCS on every version release #5104

vtaskow · 2023-08-23T10:34:15Z

What this PR does / why we need it:
The next version of MLServer will be allowing users to create custom HuggingFace models. Users will be able to specify a location in which model artefacts will be located and later used. This will allow users to customize certain model parameters, such as tokenizers, feature extractors, image processors, etc and then build a HuggingFace model and use it, not limiting themselves to existing models on the official hub.

Automating the creation and the upload of a simple custom HuggingFace model to our collection of models in our GCS bucket - seldon-models, will pave the way to use said model in upcoming demos, showcasing how users can actually load a custom model and use it in SCv2 or in our paid products.

By having this automation, we ensure that on every new MLServer version release, the custom HF model is added to our collection of models in the bucket and demo pages are kept up-to-date.

By having this change separated from a similar SCv2-related change, I am ensuring that SCv1 related demo pages are using models uploaded in non-SCv2 folders in GCS. This will avoid confusion as to why a SCv1-related demo is using a model uploaded to gs://seldon-models/scv2/... folder.

Testing

The resulted model was generated correctly by running make env && make train
The generated model was uploaded correctly to a public GCS bucket(mine in this case) by running make upload
The uploaded model was used correctly by creating a K8s SeldonDeployment resource and was able to correctly make a prediction without any additional downloads

apiVersion: machinelearning.seldon.io/v1alpha2
kind: SeldonDeployment
metadata:
  name: huggingface-model
spec:
  protocol: v2
  predictors:
  - graph:
      name: transformer
      implementation: HUGGINGFACE_SERVER
      modelUri: gs://viktor-models/v1.18.0-dev/huggingface/text-generation
      parameters:
      - name: task
        type: STRING
        value: text-generation
    componentSpecs:
      - spec:
          containers:
            - name: transformer
              resources:
                limits:
                  cpu: 1
                  memory: 4Gi
                requests:
                  cpu: 100m
                  memory: 3Gi
    name: default
    replicas: 1

Which issue(s) this PR fixes:
Fixes #

Special notes for your reviewer:

review-notebook-app · 2023-08-23T13:01:13Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

adriangonz

Nice one @vtaskow! Looks good to me.

…to seldon-models in GCS on every version release (SeldonIO#5104) * V2D-1258 Create and upload a custom HF model to seldon-models in GCS * Remove left over link for kfserving storage initialiser * Change GCS location to make it obvious it is a custom hf model

V2D-1258 Create and upload a custom HF model to seldon-models in GCS

70a7714

vtaskow requested review from RafalSkolasinski, adriangonz, ukclivecox and agrski August 23, 2023 11:15

vtaskow marked this pull request as ready for review August 23, 2023 11:19

vtaskow changed the title ~~feat(docs): Create and upload a custom HF model to seldon-models in GCS for SCv1~~ feat(docs): [SCv1] Automatically create and upload a custom HF model to seldon-models in GCS on every version release Aug 23, 2023

Remove left over link for kfserving storage initialiser

72f2a22

Change GCS location to make it obvious it is a custom hf model

4f09070

vtaskow mentioned this pull request Aug 23, 2023

feat(docs): [SCv1] Add a section about loading custom HuggingFace models #5105

Merged

2 tasks

vtaskow self-assigned this Aug 23, 2023

adriangonz approved these changes Aug 24, 2023

View reviewed changes

vtaskow merged commit 7c2d4f1 into SeldonIO:master Aug 24, 2023
11 of 12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(docs): [SCv1] Automatically create and upload a custom HF model to seldon-models in GCS on every version release #5104

feat(docs): [SCv1] Automatically create and upload a custom HF model to seldon-models in GCS on every version release #5104

vtaskow commented Aug 23, 2023 •

edited

Loading

review-notebook-app bot commented Aug 23, 2023

adriangonz left a comment

feat(docs): [SCv1] Automatically create and upload a custom HF model to seldon-models in GCS on every version release #5104

feat(docs): [SCv1] Automatically create and upload a custom HF model to seldon-models in GCS on every version release #5104

Conversation

vtaskow commented Aug 23, 2023 • edited Loading

review-notebook-app bot commented Aug 23, 2023

adriangonz left a comment

Choose a reason for hiding this comment

vtaskow commented Aug 23, 2023 •

edited

Loading