Python: Changed handling of large requests to transfer them as leaked pointers #1655

barshaul · 2024-06-25T17:45:58Z

Large protobuf messages are extremely slow to decode/encode, and with large messages (e.g., 512MB), the client becomes unresponsive. We resolve this issue by passing the arguments as a leaked pointer instead of a bytes vector when the message size exceeds the limit.

When we merge in the bytes/string support, we should change these functions to the relevant types. Added TODOs in the code for that matter.

… pointers

avifenesh · 2024-06-26T13:24:25Z

python/python/glide/redis_client.py

+        """
+
+        # TODO: Allow passing different encoding options
+        return bytes(arg, encoding="utf8")


It is not completely platform agnostic - we need to add handling in utf8 usage.
For example AIX and Solaris are unix based OS that don't use utf8 in their default. It is possible to configure new versions to use utf8, but the old versions don't even have the option.
If at some point we'll want to support windows, it can be even more problematic.
It might not be relevant enough at that point, but we need more than allowing the passing of different encoding. To avoid crashing or producing incorrect results, we need to check the OS we are running on when no specific encoding is passed.

ATM we will support only utf8 strings and bytes as passed arguments and return values. Later on we will support encoder as a client configuration which will allow more encoding options. If the user doesn't use utf8 strings, he can use bytes instead

avifenesh · 2024-06-26T13:28:30Z

python/python/glide/glide.pyi


 from glide.constants import TResult

 DEFAULT_TIMEOUT_IN_MILLISECONDS: int = ...
+MAX_REQUEST_ARGS_LEN: int = ...


Please explain somewhere what is that size, and how is it possible that this size come from rust but is relevant for all the languages. Something here is a bit confusing.

The variable is defined and documented in socket_listener.rs, i'm only importing it here and exporting it to the python wrapper

avifenesh · 2024-06-26T13:34:29Z

It is not clear enough what are you doing here.
When you say leaked pointers i think about allocated memory that we lost the pointer to, and it is not clear to me how using it solve the issue.
It might be something simple that other will understand more easily but at least for me it wasn't, so i would like to see some doc with deeper explanation of what exactly is happening here.

barshaul · 2024-06-26T14:24:39Z

It is not clear enough what are you doing here. When you say leaked pointers i think about allocated memory that we lost the pointer to, and it is not clear to me how using it solve the issue. It might be something simple that other will understand more easily but at least for me it wasn't, so i would like to see some doc with deeper explanation of what exactly is happening here.

We discussed in the office - this is a part of the client's design. Since large messages aren't being properly handled with protobuf, we are using an FFI call with the args vec, allocating it on the heap on the rust side and leaking it so it won't be deleted by the GC. Then we pass only the pointer on the protobuf message, sending the protobuf message through the socket, and getting back the leaked memory from the received pointer in the core side.

Python: Changed handling of large requests to transfer them as leaked…

0f8224b

… pointers

barshaul force-pushed the use_args_pointer_py branch from 9e9eb9f to 0f8224b Compare June 26, 2024 09:12

barshaul marked this pull request as ready for review June 26, 2024 09:17

barshaul requested a review from a team as a code owner June 26, 2024 09:17

barshaul requested a review from avifenesh June 26, 2024 09:18

avifenesh reviewed Jun 26, 2024

View reviewed changes

avifenesh approved these changes Jun 26, 2024

View reviewed changes

barshaul merged commit 1453910 into valkey-io:main Jun 26, 2024
6 checks passed

barshaul mentioned this pull request Jul 2, 2024

Python Add large request size support for Scripts #1756

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Python: Changed handling of large requests to transfer them as leaked pointers #1655

Python: Changed handling of large requests to transfer them as leaked pointers #1655

barshaul commented Jun 25, 2024 •

edited

Loading

avifenesh Jun 26, 2024 •

edited

Loading

barshaul Jun 26, 2024

avifenesh Jun 26, 2024

barshaul Jun 26, 2024

avifenesh commented Jun 26, 2024

barshaul commented Jun 26, 2024 •

edited

Loading

Python: Changed handling of large requests to transfer them as leaked pointers #1655

Python: Changed handling of large requests to transfer them as leaked pointers #1655

Conversation

barshaul commented Jun 25, 2024 • edited Loading

avifenesh Jun 26, 2024 • edited Loading

Choose a reason for hiding this comment

barshaul Jun 26, 2024

Choose a reason for hiding this comment

avifenesh Jun 26, 2024

Choose a reason for hiding this comment

barshaul Jun 26, 2024

Choose a reason for hiding this comment

avifenesh commented Jun 26, 2024

barshaul commented Jun 26, 2024 • edited Loading

barshaul commented Jun 25, 2024 •

edited

Loading

avifenesh Jun 26, 2024 •

edited

Loading

barshaul commented Jun 26, 2024 •

edited

Loading