Text Embeddings Inference update #419

joaomsimoes · 2024-09-11T16:16:29Z

Hi everyone.

I wanted to use the Text Embeddings Inference with the encoder but I noticed two small bugs in the code. I believe that the HFEndpointEncoder was intentionally created to be used with TEI (right?)

The loop for max_retries in attempts, that is inside of the function query, has no break or any system to return the result when we have a success response. I added a break, similar to the OpenAI encoder.
The response from TEI is [[[array]]]. The array is inside of a list of a list. I remove one list when receiving the response. Without this it will throw a dimension error when comparing all the vectors.

These are the main bugs, but I would also take some time to purpose a future update. With TEI we can send a batch of texts

curl 127.0.0.1:8080/embed \
    -X POST \
    -d '{"inputs":["Today is a nice day", "I like you"]}' \
    -H 'Content-Type: application/json'

To save time, we could batch the different sentences to the endpoint. This would be great for longer document. If it sounds interesting I can try to help to develop it.

By the way, should I use semantic router for splitting text, or the semantic chunkers?

joaomsimoes · 2024-09-12T11:48:05Z

Okay, I added the batch to the encoder. The bug number 2 it is now solved.

It is also much faster to batch the requests. Before was taking around a minute for a small document. Now it takes few seconds. I took an arbitrary number for the batch. This can be studied.

jamescalam · 2024-09-21T11:58:17Z

@joaomsimoes DM'd you on discord but not sure if you still use it - we continued your PR here, can you take a look and let us know if all is correct? Thanks! #423

joaomsimoes added 3 commits September 11, 2024 18:58

Text Embeddings Inference update

8198e13

batch Text Embeddings Inference

bfce487

batch Text Embeddings Inference

3328286

jamescalam assigned ashraq1455 Sep 16, 2024

jamescalam assigned Vits-99 and unassigned ashraq1455 Sep 21, 2024

jamescalam added bug Something isn't working enhancement Enhancement to existing features labels Sep 21, 2024

joaomsimoes added 2 commits September 24, 2024 11:51

Merge branch 'aurelio-labs:main' into main

3ee04bc

batch 32

09077e2

joaomsimoes closed this by deleting the head repository Sep 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Text Embeddings Inference update #419

Text Embeddings Inference update #419

joaomsimoes commented Sep 11, 2024 •

edited

Loading

joaomsimoes commented Sep 12, 2024

jamescalam commented Sep 21, 2024

Text Embeddings Inference update #419

Text Embeddings Inference update #419

Conversation

joaomsimoes commented Sep 11, 2024 • edited Loading

joaomsimoes commented Sep 12, 2024

jamescalam commented Sep 21, 2024

joaomsimoes commented Sep 11, 2024 •

edited

Loading