Fix vision model graph capture not creating static buffers for embedding #942

PatriceVignola · 2024-09-28T07:47:57Z

This change essentially reverses the assignment of the embeddings memory. Instead of creating the embeddings tensor in the embedding model and pointing the embeddings of the text model to it, we now create the embeddings tensor inside the text model and point the embeddings of the embedding model to it.

The reason to do that is that the text model can possibly be in "graph capture mode", which means that it allocates static buffers that it uses between iterations, and even between generators. If we allocate the memory in the embedding model and point the text model to it, the memory will become invalid when the generator is destroyed and the captured graph will exhibit undefined behavior (mostly spitting out garbage output). But by pointing the embeddings output of the embedding model towards the static buffer created by the text model, we can be certain that the memory will stay alive for the duration of the model.

This PR doesn't change the behavior of the non-graph capture mode since it really doesn't matter in that scenario whether the tensor is created by the embedding model or the text model, but it fixes graph capture usage for vision models.

Fix vision model graph capture not creating static buffers for embedding

5493361

PatriceVignola requested review from baijumeswani, yufenglee and kunal-vaishnavi September 28, 2024 07:47

kunal-vaishnavi approved these changes Oct 7, 2024

View reviewed changes

PatriceVignola merged commit 77a88c3 into main Oct 7, 2024
13 checks passed

PatriceVignola deleted the user/pavignol/fix-vision-model-graph-capture branch October 7, 2024 20:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix vision model graph capture not creating static buffers for embedding #942

Fix vision model graph capture not creating static buffers for embedding #942

PatriceVignola commented Sep 28, 2024

Fix vision model graph capture not creating static buffers for embedding #942

Fix vision model graph capture not creating static buffers for embedding #942

Conversation

PatriceVignola commented Sep 28, 2024