Lightweight Conversational AI with WebGPU

This project demonstrates an innovative architecture for deploying lightweight, optimized large language models (LLMs) in web applications. By leveraging the WebGPU web API, the solution enables efficient inference on client-side devices, promoting energy efficiency and sustainability in conversational AI. This demo is created for ICCRET 2025 paper submission.

Features

Client-Side Inference: Utilizes WebGPU API for running models directly in the browser.
Optimized Model: Uses the Qwen2.5-0.5B-Instruct-q4f16_1-MLC model.
User-Friendly Interface: Built with Svelte.js for a responsive and intuitive user experience.

Installation

To get started with this project, clone the repository and install the necessary dependencies:

git clone https://github.com/SouZe-San/webllm-bot.git
cd webllm-bot
bun install

Usage

Run the application locally:

bun run dev

Open your browser and navigate to http://localhost:5173 to access the demo customer support site and the chatbot.

Acknowledgements

We would like to thank the teams behind the WebLLM JavaScript Library, which facilitated the integration of pre-compiled models and loading to WebGPU.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.vscode		.vscode
public		public
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
bun.lockb		bun.lockb
index.html		index.html
package-lock.json		package-lock.json
package.json		package.json
svelte.config.js		svelte.config.js
tsconfig.app.json		tsconfig.app.json
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
vite.config.ts		vite.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Lightweight Conversational AI with WebGPU

Features

Installation

Usage

Acknowledgements

About

Releases

Packages

Contributors 2

Languages

License

SouZe-San/webllm-bot

Folders and files

Latest commit

History

Repository files navigation

Lightweight Conversational AI with WebGPU

Features

Installation

Usage

Acknowledgements

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages