Chatbot with Gradio, FastApi Endpoint, Langchain Integration #1246

april-yyt · 2023-12-13T13:21:34Z

Description of changes:

UseCases

1. Gradio Interface Integration

A new Gradio Chat Interface has been implemented, allowing interactive conversational engagement with FlexFlow.
This interface generates responses based on user input through a streamlined and user-friendly web interface.

2. FastAPI Integration

The FastAPI application has been updated for better performance and usability.
The application can be run using the command uvicorn fastapi_app:app --reload --port PORT_NUMBER, and is accessible at http://localhost:PORT_NUMBER.
The server setup includes comprehensive API documentation accessible at http://localhost:PORT_NUMBER/docs.

3. FlexFlow and LangChain Integration

FlexFlowLLM Class: This class manages the initialization, configuration, and operation of the FlexFlow server.
FF_LLM_wrapper Class: Acts as a wrapper for FlexFlow, facilitating interaction with the LangChain library.
Main Execution Flow: The main execution now includes initializing FlexFlow, compiling and starting the server with generation configurations, and implementing a prompt template for response generation using LLMChain.

Usage

For Gradio Interface: Run the script with an optional configuration file for custom settings, and interact with the model through the Gradio web interface.
For FastAPI Application: Execute the application with Uvicorn, with the command uvicorn fastapi_app:app --reload --port PORT_NUMBER, and use the provided base URL for making API requests or accessing documentation.
For FlexFlow and LangChain: Running the script would initialize the FlexFlowLLM with an optional configuration file, compile and start the server, and use the FF_LLM_wrapper with LLMChain for generating responses to predefined questions.

Related Issues:

Linked Issues:

Issue #

Issues closed by this PR:

Closes #

This change is

… background_worker

goliaro · 2024-01-19T19:58:03Z

inference/python/spec_infer.py

@@ -148,8 +148,6 @@ def main():
        results = llm.generate(prompts)
    else:
        result = llm.generate("Three tips for staying healthy are: ")
-
-    llm.stop_server()


why do we need to remove this? @april-yyt

That's a misdeletion, I will add it back.

jiazhihao and others added 30 commits November 2, 2023 00:23

add a background server for RequestManager

c33ec8d

.

9ec4cdb

make incr_decoding work

8260fd8

make spec_infer work

9bbc806

format

3b6f7a9

update python inference

5ebc914

resolve merge conflict

e1d606f

fix python issues

be42e20

bug fix

400d5bd

Merge branch 'inference' into background_worker

2a17173

Merge branch 'inference' into background_worker

56f9f2b

add a Legion future to capture the termination of the background server

0713433

Merge branch 'inference' into background_worker

499fab8

Merge branch 'inference' into background_worker

d908b1a

Merge branch 'inference' into background_worker

938a2d6

Merge branch 'inference' into background_worker

7125f95

gradio finished

0219245

chatbot gradio version 2

7404652

chainlit1

48aa14d

chainlit2

d7f9ed5

fastapi done

9d0d3ec

fastapi incr_decoding

889cdf8

langchain example & wrapper class

1b2eac7

langchain example & wrapper class1

ad0a42a

added documentation

f1f7e9d

Merge branch 'inference' into background_worker

91c7e94

Merge branch 'inference' into background_worker

6cdd948

entrypoint

b4fe796

del apikey

0d9c08e

delete extra files

bb3acdf

april-yyt and others added 23 commits January 5, 2024 19:42

merge background worker

3bd11ae

Merge branch 'inference' into background_worker

70212f6

Add server task timeout.

a58aa6d

Merge branch 'inference' of https://github.com/flexflow/FlexFlow into…

1725c81

… background_worker

Merge branch 'inference' of https://github.com/flexflow/FlexFlow into…

4dd98bb

… background_worker

register callbacks to terminate background worker at exit or termination

0bce49a

[Python] enable decoding multiple requests

058308c

update README.md and default configuration

37feea4

fix issues with gradio and prompt template

d8a4988

fix issues with rag

4c2acbb

adjusted fastapi entrypoint

e451b30

update documentation

e275958

Merge remote-tracking branch 'origin/background_worker' into chatbot-2

33279b7

resole conflicts

a232328

merge background-worker branch

c10bb08

issues fix

8fcb40d

resolve conflicts from inference

f38165c

adjustments on usecases and api entrypoints

9d1a901

remove redundent changes

437577e

testing CI

59c3e9c

Merge branch 'inference' into chatbot-2

05a2907

Enable backtrace

ed3cf46

restore newlines

56c923e

goliaro reviewed Jan 19, 2024

View reviewed changes

xinhaoc and others added 4 commits January 25, 2024 15:36

version

57c2e22

Merge branch 'inference' into chatbot-2

7892e40

add back misdeleted line

93cf72f

legion verion

587a0d2

jiazhihao merged commit abf9fb8 into inference Jan 26, 2024
44 checks passed

goliaro deleted the chatbot-2 branch February 2, 2024 17:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chatbot with Gradio, FastApi Endpoint, Langchain Integration #1246

Chatbot with Gradio, FastApi Endpoint, Langchain Integration #1246

april-yyt commented Dec 13, 2023 •

edited by wmdi

Loading

goliaro Jan 19, 2024 •

edited

Loading

april-yyt Jan 25, 2024

Chatbot with Gradio, FastApi Endpoint, Langchain Integration #1246

Chatbot with Gradio, FastApi Endpoint, Langchain Integration #1246

Conversation

april-yyt commented Dec 13, 2023 • edited by wmdi Loading

UseCases

1. Gradio Interface Integration

2. FastAPI Integration

3. FlexFlow and LangChain Integration

Usage

goliaro Jan 19, 2024 • edited Loading

Choose a reason for hiding this comment

april-yyt Jan 25, 2024

Choose a reason for hiding this comment

april-yyt commented Dec 13, 2023 •

edited by wmdi

Loading

goliaro Jan 19, 2024 •

edited

Loading