streaming server support? #20

kevintanhongann · 2024-02-23T12:25:10Z

Is there a way to run and expose an API streaming server compatible with OpenAI API specifications?

tjake · 2024-02-23T13:41:20Z

Probably, here's the current API call for chat

https://github.com/tjake/Jlama/blob/main/jlama-cli/src/main/java/com/github/tjake/jlama/cli/serve/GenerateResource.java

phact · 2024-02-23T14:14:03Z

I want this feature too

geoand · 2024-03-01T08:51:22Z

I am pretty sure that this would (at least) require Generator#generate to be enhanced with a callback that is called when the generation is complete.

phact · 2024-03-07T15:34:40Z

You mean for stream=false?

geoand · 2024-03-07T16:10:50Z

For both :)

phact · 2024-03-07T21:38:33Z

working PR here #23

tjake linked a pull request Aug 9, 2024 that will close this issue

Add openai-api built from openapi spec #43

Merged

tjake closed this as completed in #43 Aug 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

streaming server support? #20

streaming server support? #20

kevintanhongann commented Feb 23, 2024

tjake commented Feb 23, 2024

phact commented Feb 23, 2024

geoand commented Mar 1, 2024

phact commented Mar 7, 2024

geoand commented Mar 7, 2024

phact commented Mar 7, 2024

streaming server support? #20

streaming server support? #20

Comments

kevintanhongann commented Feb 23, 2024

tjake commented Feb 23, 2024

phact commented Feb 23, 2024

geoand commented Mar 1, 2024

phact commented Mar 7, 2024

geoand commented Mar 7, 2024

phact commented Mar 7, 2024