Improve redis semantic cache implementation #5412

tylerhutcherson · 2024-08-28T17:53:38Z

Improve Redis semantic caching

Relevant issues

Fixes Server failing to start while setting for Redis Semantic Cache [Bug]: #3056
Fixes [Bug]: New schema format introduced; please update schema spec. (type=value_error) [redis-semantic] #3363
Fixes [Feature]: Allow Redis Semantic caching with custom Embedding models #4001

Type

🆕 New Feature
🐛 Bug Fix
🧹 Refactoring
📖 Documentation

Changes

Use latest version of RedisVL (0.3.2) and fix pydantic & schema version issues
Support isolated/customized index names to allow for multitenancy
Support arbitrary embedding model in index schema provided through litellm.embeddings module
Use built-in SemanticCache extension for cleaner code and processing
Add TTL support to redis semantic cache (OOTB feature of Redis rather than a tack-on)

vercel · 2024-08-28T17:53:43Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
litellm	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Sep 11, 2024 0:26am

rbs333

Generally LGTM had a question about the todo

litellm/caching.py

ishaan-jaff

Is this missing async support @tylerhutcherson ?

If yes, can you add async support on this PR. Majority of users are trying to use this our llm gateway - which requires all async functions

tylerhutcherson · 2024-08-30T18:53:19Z

Is this missing async support @tylerhutcherson ?

If yes, can you add async support on this PR. Majority of users are trying to use this our llm gateway - which requires all async functions

Hey @ishaan-jaff @krrishdholakia see my comment here:
#5412 (comment)

Let me know what you think!

ishaan-jaff · 2024-08-30T18:56:58Z

@tylerhutcherson

Async Support is a hard requirement for LiteLLM. Majority of our users use async. If we merge this and users use this in prod their litellm service will go down. We have seen this before when a non-async function was used.

We're happy to wait on this until redisvl add async support

tylerhutcherson · 2024-08-30T19:34:23Z

@tylerhutcherson

Async Support is a hard requirement for LiteLLM. Majority of our users use async. If we merge this and users use this in prod their litellm service will go down. We have seen this before when a non-async function was used.

We're happy to wait on this until redisvl add async support

Understood -- thanks for the clarity. Will see what we can do, thanks!

tylerhutcherson · 2024-09-06T18:01:57Z

FYI redis/redis-vl-python#214 We are close to finalizing support.

This PR introduces new async compliant methods to the semantic cache class using lazy index construction. Because the `AsyncSearchIndex` requires an async redis python client, we needed to construct that class lazily upon first usage within the semantic cache class. This PR fixes some unclosed connection errors and is also in support of BerriAI/litellm#5412 at LiteLLM.

tylerhutcherson · 2024-09-09T20:37:11Z

Now that https://github.com/redis/redis-vl-python 0.3.3 is out with async sem cache support.... I have updated this branch to reflect that. Thanks!

@ishaan-jaff @krrishdholakia

ishaan-jaff

looks good - could you send a screenshot of your tests passing for you locally ? Happy to merge after that

litellm/caching.py

ishaan-jaff

looks good - could you send a screenshot of your tests passing for you locally ? Happy to merge after that @tylerhutcherson

tylerhutcherson · 2024-09-10T18:21:31Z

looks good - could you send a screenshot of your tests passing for you locally ? Happy to merge after that @tylerhutcherson

Sure thing -- screenshot below. A few caveats:

There seems to be some issue with async tests here. The poetry env does not include pytest-asyncio or anything so the test module skips all async tests and throws warnings like this?

litellm/tests/test_caching.py: 20 warnings
  /Users/tyler.hutcherson/Library/Caches/pypoetry/virtualenvs/litellm-g3ANcoZD-py3.11/lib/python3.11/site-packages/_pytest/python.py:183: PytestUnhandledCoroutineWarning: async def functions are not natively supported and have been skipped.
  You need to install a suitable plugin for your async framework, for example:
    - anyio
    - pytest-asyncio
    - pytest-tornasync
    - pytest-trio
    - pytest-twisted
    warnings.warn(PytestUnhandledCoroutineWarning(msg.format(nodeid)))

-- Docs: https://docs.pytest.org/en/stable/how-to/capture-warnings.html

Also I don't have Azure API access nor a dependency called diskcache for these tests... so there are 3 tests that fail (seen below). All of the Redis cases pass using poetry run pytest litellm/tests/test_caching.py.

@ishaan-jaff

…ache

tylerhutcherson added 2 commits August 28, 2024 11:31

upgrade redisvl package

379b919

improve redis semantic cache implementation

f031cf2

vercel bot deployed to Preview August 28, 2024 17:54 View deployment

rbs333 approved these changes Aug 28, 2024

View reviewed changes

litellm/caching.py Outdated Show resolved Hide resolved

add TTL support and improve tests

2834551

vercel bot deployed to Preview August 28, 2024 18:25 View deployment

ishaan-jaff changed the base branch from main to litellm_stable_branch_staging August 29, 2024 01:14

ishaan-jaff requested changes Aug 29, 2024

View reviewed changes

tylerhutcherson mentioned this pull request Sep 6, 2024

Add async support to semantic cache extension redis/redis-vl-python#214

Merged

update to use latest redisvl==0.3.3

590e95b

vercel bot deployed to Preview September 9, 2024 20:38 View deployment

tylerhutcherson requested a review from ishaan-jaff September 10, 2024 16:00

ishaan-jaff reviewed Sep 10, 2024

View reviewed changes

litellm/caching.py Outdated Show resolved Hide resolved

litellm/caching.py Outdated Show resolved Hide resolved

litellm/caching.py Outdated Show resolved Hide resolved

ishaan-jaff requested changes Sep 10, 2024

View reviewed changes

ishaan-jaff changed the base branch from litellm_stable_branch_staging to litellm_stable_pr_merges September 11, 2024 00:23

Merge branch 'litellm_stable_pr_merges' into improve-redis-semantic-c…

77d9d09

…ache

ishaan-jaff merged commit 2b181a7 into BerriAI:litellm_stable_pr_merges Sep 11, 2024
1 of 2 checks passed

vercel bot deployed to Preview September 11, 2024 00:26 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve redis semantic cache implementation #5412

Improve redis semantic cache implementation #5412

tylerhutcherson commented Aug 28, 2024 •

edited

Loading

vercel bot commented Aug 28, 2024 •

edited

Loading

rbs333 left a comment

ishaan-jaff left a comment

tylerhutcherson commented Aug 30, 2024

ishaan-jaff commented Aug 30, 2024

tylerhutcherson commented Aug 30, 2024

tylerhutcherson commented Sep 6, 2024

tylerhutcherson commented Sep 9, 2024

ishaan-jaff left a comment

ishaan-jaff left a comment

tylerhutcherson commented Sep 10, 2024

Improve redis semantic cache implementation #5412

Improve redis semantic cache implementation #5412

Conversation

tylerhutcherson commented Aug 28, 2024 • edited Loading

Improve Redis semantic caching

Relevant issues

Type

Changes

vercel bot commented Aug 28, 2024 • edited Loading

rbs333 left a comment

Choose a reason for hiding this comment

ishaan-jaff left a comment

Choose a reason for hiding this comment

tylerhutcherson commented Aug 30, 2024

ishaan-jaff commented Aug 30, 2024

tylerhutcherson commented Aug 30, 2024

tylerhutcherson commented Sep 6, 2024

tylerhutcherson commented Sep 9, 2024

ishaan-jaff left a comment

Choose a reason for hiding this comment

ishaan-jaff left a comment

Choose a reason for hiding this comment

tylerhutcherson commented Sep 10, 2024

tylerhutcherson commented Aug 28, 2024 •

edited

Loading

vercel bot commented Aug 28, 2024 •

edited

Loading