Add usage snippets for Google Health AI models #1084

ndebuhr · 2024-12-24T03:58:43Z

Adding usage snippets to improve Hugging Face Hub model pages for CXR Foundation and Derm Foundation - improving usability via example Python code and linked Jupyter notebooks. Doing this at the model library level, to complement the usage documentation in the model cards, so that the "Use this model" Hub button provides useful information (currently "integration status unknown").

… AI)

Co-authored-by: vb <vaibhavs10@gmail.com>

…ation models (Google Health AI), to improve their Hugging Face Hub pages and ease model adoption

…gle-hai-libraries

Vaibhavs10

Thanks for the PR @ndebuhr - re: both snippets:

Both should actually be snippets (we discourage putting URLs to notebooks etc)
For the other code snippet, we try to put the least possible lines of code that someone can understand/ work with. So would be great to reduce it.

Otherwise, I think I think this information can be nicely fit in the Model Card.

packages/tasks/src/model-libraries-snippets.ts

…gle-hai-libraries

…mage preprocessing and embeddings computation, not post-processing/visualization

ndebuhr · 2025-01-02T22:38:26Z

@Vaibhavs10 I can collapse some lines or remove some comments to further reduce the LOC, but I think that would impact readability (especially in the relatively low-width snippet modal). Hoping this second pass is better?

…tilizes CxrClient make_hugging_face_client

Vaibhavs10

Thanks a lot for iterating here, some last nits and we're good to merge!

Vaibhavs10 · 2025-01-03T09:16:43Z

packages/tasks/src/model-libraries-snippets.ts

@@ -95,6 +95,26 @@ export const bm25s = (model: ModelData): string[] => [
 retriever = BM25HF.load_from_hub("${model.id}")`,
 ];

+export const cxr_foundation = (model: ModelData): string[] => [
+	`# Install library


I think we can remove the install instructions completely from here and keep them in the model card (As they are currently and just start from from PIL import Image

Since we aren't doing a "typical" installation (e.g., pip install from PyPI or git), I'm a bit concerned about pulling that information out of the snippet. I think people will make assumptions that aren't true. Basically, there's no way somebody is going to correctly use the library or guess the installation without this, so I'd really prefer we keep that critical information in the snippet (as it looks like some other libraries have done). Is that fair?

@ndebuhr from where did you got these installation steps? I would be more inclined to add a single comment line to redirect users to the installation step. Something like this:

# Install from https://github.com/Google-Health/cxr-foundation

This is what is done for quite some libraries already and the direction we want to take for new ones. The problem of adding installation guide in the code snippet is that it makes the snippets much longer and much harder to maintain (installation guides can be tricky). In the case here, the best thing would be to have those few lines in cxr-foundation readme directly.

packages/tasks/src/model-libraries-snippets.ts

…gle-hai-libraries

Wauplin

Thanks for the PR! Agree with @Vaibhavs10 on trying to get more concise examples (though still readable).

Wauplin · 2025-01-14T17:19:52Z

packages/tasks/src/model-libraries-snippets.ts

+cxr_client = make_hugging_face_client('cxr_model')
+!wget -nc -q https://upload.wikimedia.org/wikipedia/commons/c/c8/Chest_Xray_PA_3-8-2010.png
+
+print(cxr_client.get_image_embeddings_from_images([Image.open("Chest_Xray_PA_3-8-2010.png")]))`,


Interlacing python lines of code with command lines feels weird. Could you switch to a full Python snippet using for instance requests.get("https://upload.wikimedia.org/wikipedia/commons/c/c8/Chest_Xray_PA_3-8-2010.png") ? (as done below)

Here is a pure python solution. Alternatively we can require the user to provide a local image.

import requests from io import BytesIO # Image attribution: Stillwaterising, CC0, via Wikimedia Commons image_url = "https://upload.wikimedia.org/wikipedia/commons/c/c8/Chest_Xray_PA_3-8-2010.png" response = requests.get(image_url, headers={'User-Agent': 'Demo'}, stream=True) response.raw.decode_content = True # Ensure correct decoding img = Image.open(BytesIO(response.content)).convert('L') # Convert to grayscale

I tested the following complete snippet:

!git clone https://github.com/Google-Health/cxr-foundation.git import tensorflow as tf, sys, requests sys.path.append('cxr-foundation/python/') # Install dependencies major_version = tf.__version__.rsplit(".", 1)[0] !pip install tensorflow-text=={major_version} pypng && pip install --no-deps pydicom hcls_imaging_ml_toolkit retrying # Load image (Stillwaterising, CC0, via Wikimedia Commons) from PIL import Image from io import BytesIO image_url = "https://upload.wikimedia.org/wikipedia/commons/c/c8/Chest_Xray_PA_3-8-2010.png" response = requests.get(image_url, headers={'User-Agent': 'Demo'}, stream=True) response.raw.decode_content = True # Ensure correct decoding img = Image.open(BytesIO(response.content)).convert('L') # Convert to grayscale # Run inference from clientside.clients import make_hugging_face_client cxr_client = make_hugging_face_client('cxr_model') print(cxr_client.get_image_embeddings_from_images([img]))

I tested it locally and it seems that this simplified snippet works as well for loading image. Could you replace it?

# Load image from PIL import Image from io import BytesIO import requests response = requests.get("https://upload.wikimedia.org/wikipedia/commons/c/c8/Chest_Xray_PA_3-8-2010.png") img = Image.open(BytesIO(response.content)).convert('L') # Convert to grayscale

Also, what I don't like with

!git clone https://github.com/Google-Health/cxr-foundation.git import tensorflow as tf, sys, requests sys.path.append('cxr-foundation/python/') # Install dependencies major_version = tf.__version__.rsplit(".", 1)[0] !pip install tensorflow-text=={major_version} pypng && pip install --no-deps pydicom hcls_imaging_ml_toolkit retrying

is that it involves both command lines and python code. This is ok-ish in a notebook environment but it is not a valid Python snippet. I would really prefer if these installation instructions were added to the Github repo or in a CXR-foundation wiki directly. This way, lines could be explained to the user which we don't do here. Since the installation process is complex I do think it's important to explain it in a proper markdown document rather than an autogenerated code snippet on the Hugging Face Hub.

packages/tasks/src/model-libraries-snippets.ts

Wauplin · 2025-01-14T17:25:04Z

packages/tasks/src/model-libraries-snippets.ts

+buf = BytesIO()
+image.convert("RGB").save(buf, "PNG")
+image_bytes = buf.getvalue()


Is this step mandatory for the provided example? https://storage.googleapis.com/dx-scin-public-data/dataset/images/3445096909671059178.png seems to already be an png file with rgb colors

Good point! Below is a more condensed snippet:

from huggingface_hub import from_pretrained_keras import tensorflow as tf, requests # Load and format input IMAGE_URL = "https://storage.googleapis.com/dx-scin-public-data/dataset/images/3445096909671059178.png" input_tensor = tf.train.Example( features=tf.train.Features( feature={ "image/encoded": tf.train.Feature( bytes_list=tf.train.BytesList(value=[requests.get(IMAGE_URL, stream=True).content]) ) } ) ).SerializeToString() # Load model and run inference loaded_model = from_pretrained_keras("google/derm-foundation") infer = loaded_model.signatures["serving_default"] print(infer(inputs=tf.constant([input_tensor])))

wzwz1

I've incorporated the feedback and rewritten the snippets and tested them.

wzwz1 · 2025-01-14T19:48:56Z

packages/tasks/src/model-libraries-snippets.ts

+cxr_client = make_hugging_face_client('cxr_model')
+!wget -nc -q https://upload.wikimedia.org/wikipedia/commons/c/c8/Chest_Xray_PA_3-8-2010.png
+
+print(cxr_client.get_image_embeddings_from_images([Image.open("Chest_Xray_PA_3-8-2010.png")]))`,


Here is a pure python solution. Alternatively we can require the user to provide a local image.

import requests from io import BytesIO # Image attribution: Stillwaterising, CC0, via Wikimedia Commons image_url = "https://upload.wikimedia.org/wikipedia/commons/c/c8/Chest_Xray_PA_3-8-2010.png" response = requests.get(image_url, headers={'User-Agent': 'Demo'}, stream=True) response.raw.decode_content = True # Ensure correct decoding img = Image.open(BytesIO(response.content)).convert('L') # Convert to grayscale

I tested the following complete snippet:

!git clone https://github.com/Google-Health/cxr-foundation.git import tensorflow as tf, sys, requests sys.path.append('cxr-foundation/python/') # Install dependencies major_version = tf.__version__.rsplit(".", 1)[0] !pip install tensorflow-text=={major_version} pypng && pip install --no-deps pydicom hcls_imaging_ml_toolkit retrying # Load image (Stillwaterising, CC0, via Wikimedia Commons) from PIL import Image from io import BytesIO image_url = "https://upload.wikimedia.org/wikipedia/commons/c/c8/Chest_Xray_PA_3-8-2010.png" response = requests.get(image_url, headers={'User-Agent': 'Demo'}, stream=True) response.raw.decode_content = True # Ensure correct decoding img = Image.open(BytesIO(response.content)).convert('L') # Convert to grayscale # Run inference from clientside.clients import make_hugging_face_client cxr_client = make_hugging_face_client('cxr_model') print(cxr_client.get_image_embeddings_from_images([img]))

wzwz1 · 2025-01-14T21:55:30Z

packages/tasks/src/model-libraries-snippets.ts

+buf = BytesIO()
+image.convert("RGB").save(buf, "PNG")
+image_bytes = buf.getvalue()


Good point! Below is a more condensed snippet:

from huggingface_hub import from_pretrained_keras import tensorflow as tf, requests # Load and format input IMAGE_URL = "https://storage.googleapis.com/dx-scin-public-data/dataset/images/3445096909671059178.png" input_tensor = tf.train.Example( features=tf.train.Features( feature={ "image/encoded": tf.train.Feature( bytes_list=tf.train.BytesList(value=[requests.get(IMAGE_URL, stream=True).content]) ) } ) ).SerializeToString() # Load model and run inference loaded_model = from_pretrained_keras("google/derm-foundation") infer = loaded_model.signatures["serving_default"] print(infer(inputs=tf.constant([input_tensor])))

ndebuhr · 2025-01-15T14:43:41Z

I'll cement these discussions into commits shortly. Thanks team.

ndebuhr and others added 5 commits November 26, 2024 00:53

Add CXR Foundation and Derm Foundation model libraries (Google Health…

67a3e17

… AI)

Implement the PR review suggestion for Derm Foundation repo name casing

d02c152

Co-authored-by: vb <vaibhavs10@gmail.com>

Implement the PR review suggestion for CXR Foundation repo name casing

a9dc7dc

Co-authored-by: vb <vaibhavs10@gmail.com>

Add model library usage snippets to the CXR Foundation and Derm Found…

1560b82

…ation models (Google Health AI), to improve their Hugging Face Hub pages and ease model adoption

Merge branch 'main' of github.com:huggingface/huggingface.js into goo…

1e841b1

…gle-hai-libraries

ndebuhr requested review from SBrandeis, gary149, Wauplin, julien-c, pcuenca and ngxson as code owners December 24, 2024 03:58

Vaibhavs10 reviewed Dec 24, 2024

View reviewed changes

packages/tasks/src/model-libraries-snippets.ts Show resolved Hide resolved

packages/tasks/src/model-libraries-snippets.ts Show resolved Hide resolved

ndebuhr added 3 commits January 2, 2025 22:22

Merge branch 'main' of github.com:huggingface/huggingface.js into goo…

4f24453

…gle-hai-libraries

Remove the CXR Foundation model library snippet

f31df49

Simplify the Derm Foundation model library usage snippet - focus on i…

719c7af

…mage preprocessing and embeddings computation, not post-processing/visualization

Add a new usage snippet for the CXR Foundation model library, which u…

c4d3af3

…tilizes CxrClient make_hugging_face_client

ndebuhr requested a review from Vaibhavs10 January 2, 2025 23:29

Vaibhavs10 reviewed Jan 3, 2025

View reviewed changes

ndebuhr added 2 commits January 7, 2025 18:34

Merge branch 'main' of github.com:huggingface/huggingface.js into goo…

4809837

…gle-hai-libraries

Trim down the Derm Foundation usage snippet comments

fa66830

Wauplin reviewed Jan 14, 2025

View reviewed changes

wzwz1 reviewed Jan 14, 2025

View reviewed changes

Apply suggestions from code review

60a89d5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add usage snippets for Google Health AI models #1084

Add usage snippets for Google Health AI models #1084

ndebuhr commented Dec 24, 2024

Vaibhavs10 left a comment

ndebuhr commented Jan 2, 2025

Vaibhavs10 left a comment

Vaibhavs10 Jan 3, 2025

ndebuhr Jan 7, 2025

Wauplin Jan 14, 2025 •

edited

Loading

Wauplin left a comment

Wauplin Jan 14, 2025

wzwz1 Jan 14, 2025

Wauplin Jan 15, 2025

Wauplin Jan 15, 2025 •

edited

Loading

Wauplin Jan 14, 2025

wzwz1 Jan 14, 2025

wzwz1 left a comment

wzwz1 Jan 14, 2025

wzwz1 Jan 14, 2025

ndebuhr commented Jan 15, 2025

Add usage snippets for Google Health AI models #1084

Are you sure you want to change the base?

Add usage snippets for Google Health AI models #1084

Conversation

ndebuhr commented Dec 24, 2024

Vaibhavs10 left a comment

Choose a reason for hiding this comment

ndebuhr commented Jan 2, 2025

Vaibhavs10 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Wauplin Jan 14, 2025 • edited Loading

Choose a reason for hiding this comment

Wauplin left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Wauplin Jan 15, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

wzwz1 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ndebuhr commented Jan 15, 2025

Wauplin Jan 14, 2025 •

edited

Loading

Wauplin Jan 15, 2025 •

edited

Loading