Seeking Workflow Modification for Real-Time Asynchronous Output in Llama Deploy Example #366

nmhjklnm · 2024-11-14T03:55:53Z

In the example provided at Llama Deploy Python Fullstack, the final output of the workflow is non-streaming. It only produces results once all generated tokens are complete.

As a result, I had to create my own FastAPI service.

Question:

Is there a more suitable modification to the workflow or llama-deploy that allows for direct and true asynchronous output without having to write my own FastAPI service?

Objective:

My main goal is to deliver the results of the workflow to users as quickly as possible.

logan-markewich · 2024-11-14T03:59:31Z

You can definitely stream

If you use ctx.write_event_to_stream in your workflow, you can access these streamed events with the client

logan-markewich · 2024-11-14T04:01:49Z

Still working on proper docs, but if you are using the old client

session.get_task_result_stream()

llama_deploy/llama_deploy/client/async_client.py

Line 123 in 4449d9f

async def get_task_result_stream(

Finding the example for the newer client...

logan-markewich · 2024-11-14T04:02:51Z

Ah, newer client is here
https://github.com/run-llama/llama_deploy/blob/main/docs/docs/module_guides/llama_deploy/40_python_sdk.md#a-more-complex-example

logan-markewich · 2024-11-14T04:22:14Z

Streaming outside of llama-deploy is slightly different

You still use the same ctx.write_event_to_stream method, but how you get the stream is different
https://docs.llamaindex.ai/en/stable/module_guides/workflow/#streaming-events

masci added this to Framework Nov 14, 2024

masci added the question Further information is requested label Nov 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Seeking Workflow Modification for Real-Time Asynchronous Output in Llama Deploy Example #366

Seeking Workflow Modification for Real-Time Asynchronous Output in Llama Deploy Example #366

nmhjklnm commented Nov 14, 2024

logan-markewich commented Nov 14, 2024

logan-markewich commented Nov 14, 2024

logan-markewich commented Nov 14, 2024

logan-markewich commented Nov 14, 2024

Seeking Workflow Modification for Real-Time Asynchronous Output in Llama Deploy Example #366

Seeking Workflow Modification for Real-Time Asynchronous Output in Llama Deploy Example #366

Comments

nmhjklnm commented Nov 14, 2024

logan-markewich commented Nov 14, 2024

logan-markewich commented Nov 14, 2024

logan-markewich commented Nov 14, 2024

logan-markewich commented Nov 14, 2024