Langchain-Chatchat with BigDL-LLM Acceleration on Intel GPUs

Langchain-Chatchat is a RAG (Retrieval Augmented Generation) application that implements knowledge and search engine based QA. This repo is a fork of chatchat-space/Langchain-Chatchat, and includes BigDL-LLM optimizations to run it on Intel GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and Max).

You can change the UI language in the left-side menu. We currently support English and 简体中文 (see video demos below).

English	简体中文
Langchain-chatchat-en.mp4	Langchain-chatchat-chs.mp4

The following sections introduce how to install and run Langchain-chatchat on Intel Core Ultra platform (MTL), utilizing the iGPU to run both LLMs and embedding models.

Langchain-Chatchat Architecture

See the RAG pipeline in the Langchain-Chatchat architecture below (source).

Installation

Download Langchain-Chatchat

Download the Langchain-Chatchat with BigDL-LLM integrations from this link. Unzip the content into a directory, e.g.,C:\Users\arda\Downloads\Langchain-Chatchat-bigdl-llm.

Install Prerequisites

Visit the Install BigDL-LLM on Windows with Intel GPU Guide, and follow Install Prerequisites to install Visual Studio, GPU driver, oneAPI, and Conda.

Install Python Dependencies

1. Create a Conda Environment

Open Anaconda Prompt (miniconda3), and run the following commands to create a new python environment:

conda create -n bigdl-langchain-chatchat python=3.11 libuv 
conda activate bigdl-langchain-chatchat

Note: When creating the conda environment we used python 3.11, which is different from the default recommended python version 3.9 in Install BigDL-LLM on Windows with Intel GPU

2. Install `bigdl-llm`

pip install --pre --upgrade bigdl-llm[xpu] -f https://developer.intel.com/ipex-whl-stable-xpu
pip install --pre --upgrade torchaudio==2.1.0a0  -f https://developer.intel.com/ipex-whl-stable-xpu

3. Install Langchain-Chatchat Dependencies

Switch to the root directory of Langchain-Chatchat you've downloaded (refer to the download section), and install the dependencies with the commands below. Note: In the example commands we assume the root directory is C:\Users\arda\Downloads\Langchain-Chatchat-bigdl-llm. Remember to change it to your own path).

cd C:\Users\arda\Downloads\Langchain-Chatchat-bigdl-llm
pip install -r requirements_bigdl.txt 
pip install -r requirements_api_bigdl.txt
pip install -r requirements_webui.txt

Configuration

In root directory of Langchain-Chatchat, run the following command to create a config:
```
python copy_config_example.py
```
Edit the file configs\model_config.py, change MODEL_ROOT_PATH to the absolute path where you put the downloaded models (LLMs, embedding models, ranking models, etc.)

Download Models

Download the models and place them in the path MODEL_ROOT_PATH (refer to details in Configuration section).

Currently, we support only the LLM/embedding models specified in the table below. You can download these models using the link provided in the table. Note: Ensure the model folder name matches the last segment of the model ID following "/", for example, for THUDM/chatglm3-6b, the model folder name should be chatglm3-6b.

Model	Category	download link
`THUDM/chatglm3-6b`	Chinese LLM	HF or ModelScope
`meta-llama/Llama-2-7b-chat-hf`	English LLM	HF
`BAAI/bge-large-zh-v1.5`	Chinese Embedding	HF
`BAAI/bge-large-en-v1.5`	English Embedding	HF

One-time Warm-up

When you run this applcation on Intel GPU for the first time, it is highly recommended to do a one-time warmup (for GPU kernels compilation).

In Anaconda Prompt (miniconda3), under the root directory of Langchain-Chatchat, with conda environment activated, run the following commands:

python warmup.py

Note: The warmup may take several minutes. You just have to run it one-time on after installation.

Start the Service

Open Anaconda Prompt (miniconda3) and run the following commands:

conda activate bigdl-langchain-chatchat
call "C:\Program Files (x86)\Intel\oneAPI\setvars.bat"
set SYCL_CACHE_PERSISTENT=1
set BIGDL_LLM_XMX_DISABLED=1
set no_proxy=localhost,127.0.0.1
python startup.py -a

You can find the Web UI's URL printted on the terminal logs, e.g. http://localhost:8501/.

Open a browser and navigate to the URL to use the Web UI.

Usage

To start chatting with LLMs, simply type your messages in the textbox at the bottom of the UI.

How to use RAG

Step 1: Create Knowledge Base

Select Manage Knowledge Base from the menu on the left, then choose New Knowledge Base from the dropdown menu on the right side.
Fill in the name of your new knowledge base (example: "test") and press the Create button. Adjust any other settings as needed.
Upload knowledge files from your computer and allow some time for the upload to complete. Once finished, click on Add files to Knowledge Base button to build the vector store. Note: this process may take several minutes.

Step 2: Chat with RAG

You can now click Dialogue on the left-side menu to return to the chat UI. Then in Knowledge base settings menu, choose the Knowledge Base you just created, e.g, "test". Now you can start chatting.

For more information about how to use Langchain-Chatchat, refer to Official Quickstart guide in English, Chinese, or the Wiki.

Trouble Shooting & Tips

1. Version Compatibility

Ensure that you have installed versions of bigdl-llm and bigdl-core-xe-21 that are greater than 2.5.0b20240321. You can run pip list | grep bigdl to check the verisons. To upgrade, follow the installation instructions above.

2. Prompt Templates

In the left-side menu, you have the option to choose a prompt template. There're several pre-defined templates - those ending with '_cn' are Chinese templates, and those ending with '_en' are English templates. You can also define your own prompt templates in configs/prompt_config.py. Remember to restart the service to enable these changes.

Name		Name	Last commit message	Last commit date
Latest commit History 1,878 Commits
.github		.github
chains		chains
common		common
configs		configs
docs		docs
document_loaders		document_loaders
embeddings		embeddings
img		img
nltk_data		nltk_data
server		server
tests		tests
text_splitter		text_splitter
webui_pages		webui_pages
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
README_chs.md		README_chs.md
README_en.md		README_en.md
README_ja.md		README_ja.md
copy_config_example.py		copy_config_example.py
init_database.py		init_database.py
release.py		release.py
requirements.txt		requirements.txt
requirements_api.txt		requirements_api.txt
requirements_api_bigdl.txt		requirements_api_bigdl.txt
requirements_bigdl.txt		requirements_bigdl.txt
requirements_lite.txt		requirements_lite.txt
requirements_webui.txt		requirements_webui.txt
shutdown_all.sh		shutdown_all.sh
startup.py		startup.py
warmup.py		warmup.py
webui.py		webui.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Langchain-Chatchat with BigDL-LLM Acceleration on Intel GPUs

Table of Contents

Langchain-Chatchat Architecture

Installation

Download Langchain-Chatchat

Install Prerequisites

Install Python Dependencies

1. Create a Conda Environment

2. Install `bigdl-llm`

3. Install Langchain-Chatchat Dependencies

Configuration

Download Models

One-time Warm-up

Start the Service

Usage

How to use RAG

Step 1: Create Knowledge Base

Step 2: Chat with RAG

Trouble Shooting & Tips

1. Version Compatibility

2. Prompt Templates

About

Releases

Packages

Languages

License

Oscilloscope98/Langchain-Chatchat

Folders and files

Latest commit

History

Repository files navigation

Langchain-Chatchat with BigDL-LLM Acceleration on Intel GPUs

Table of Contents

Langchain-Chatchat Architecture

Installation

Download Langchain-Chatchat

Install Prerequisites

Install Python Dependencies

1. Create a Conda Environment

2. Install bigdl-llm

3. Install Langchain-Chatchat Dependencies

Configuration

Download Models

One-time Warm-up

Start the Service

Usage

How to use RAG

Step 1: Create Knowledge Base

Step 2: Chat with RAG

Trouble Shooting & Tips

1. Version Compatibility

2. Prompt Templates

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

2. Install `bigdl-llm`

Packages