petals models local support using python env #435

biswaroop1547 · 2023-10-27T10:31:33Z

depends on Swarm mode with python env #393
depends on Swarm Mode Enhancements #440

Before merge, few issues that needs to be taken care of:

Manage two execution variants of Stable Beluga (docker & python env local) through different id and other fields, in prem-registry. - We can remove the docker variant (manifest file for stablebeluga2 of type: process using petals registry#88)
stable beluga currently concatenates user prompt too in generated responses, make sure it only shows newly generated tokens. (cht-petals: minor edits on paths and default parameters premAI-io/prem-services#125)
add prompt template for stable beluga (llama based ? or orca based ? need to check) (cht-petals: minor edits on paths and default parameters premAI-io/prem-services#125)
[NOT CRITICAL] In UI the generated responses from Stable Beluga shows up as ~~strikethrough~~
~~Takes a bit of time when starting for first time (as env creation also happens if swarm-mode wasn't run before). Need to show some message here?~~ (Not an issue currently)
[NOT CRITICAL] generation for max_new_tokens > around 5 fails sometimes since it takes longer and variable time for generation and generally a timeout occurs from prem-app's side looks like (Streaming support for Petals services premAI-io/prem-services#127):

…ersal-apple-darwin

Co-authored-by: Casper da Costa-Luis <casper.dcl@physics.org>

casperdcl · 2023-11-02T11:10:16Z

just merged #440; lemme know if you need any help rebasing @biswaroop1547 :)

…-services-python

tiero · 2023-11-02T23:49:40Z

Using latest commit I downloaded Stable Beluga, once I click Open it hangs and seems to never really start

src/controller_binaries.rs:97 2023-11-03T00:50:41 [INFO] - serve_command: setup-petals.sh --model-id petals-team/StableBeluga2 --model-path . --dht-prefix StableBeluga2-hf --port 8734
src/controller_binaries.rs:102 2023-11-03T00:50:41 [INFO] - binary_path: "/Users/tiero/Library/Application Support/io.premai.prem-app/models/stable-beluga-2/setup-petals.sh"
src/controller_binaries.rs:122 2023-11-03T00:50:41 [INFO] - args: ["--model-id", "petals-team/StableBeluga2", "--model-path", ".", "--dht-prefix", "StableBeluga2-hf", "--port", "8734"]
src/controller_binaries.rs:166 2023-11-03T00:50:42 [ERROR] - Failed to send request: error sending request for url (http://localhost:8734/v1): error trying to connect: tcp connect error: Connection refused (os error 61)
src/controller_binaries.rs:166 2023-11-03T00:50:43 [ERROR] - Failed to send request: error sending request for url (http://localhost:8734/v1): error trying to connect: tcp connect error: Connection refused (os error 61)
src/controller_binaries.rs:166 2023-11-03T00:50:43 [ERROR] - Failed to send request: error sending request for url (http://localhost:8734/v1): error trying to connect: tcp connect error: Connection refused (os error 61)
src/controller_binaries.rs:166 2023-11-03T00:50:44 [ERROR] - Failed to send request: error sending request for url (http://localhost:8734/v1): error trying to connect: tcp connect error: Connection refused (os error 61)
[TRUNCATED]

biswaroop1547 · 2023-11-03T07:13:07Z

@tiero actually it takes around ~30sec to startup the model server. To check if the server is up after that duration you can also do lsof -i :8734

…-services-python

biswaroop1547 · 2023-11-03T09:04:22Z

depends on add: handle edge cases & tests for macos #477
depends on fix: buggy state fetch #471

tiero · 2023-11-03T12:03:59Z

We should have a timeout, as I waited more than 5 minutes and it was keep haging to me. What can I do to debug?

biswaroop1547 · 2023-11-03T12:19:20Z

@tiero that's weird because it shouldn't take more than 30 secs (given you've ran the swarm before, because that creates the python env which'll be reused for petals), but if you're starting anew then it'd take around 3 mins as it also installs and sets up the python environment before starting the server (currently when the python env is being setup after you click "open" we are not showing any message), to debug can you remove ~/.config/prem once and then try again? (it'd take 2-3 mins for server to start)

filopedraz · 2023-11-03T14:16:36Z

The problem is not in the env creation. Attached here the logs.

casperdcl · 2023-11-03T14:19:09Z

btw I guess dev registry needs to be reverted back to v1 before this can be merged, right?

filopedraz · 2023-11-03T14:23:02Z

Yes, correct @casperdcl. It's just for testing. The service actually works, but it takes a huge amount of time. After the health request was successful it took 60 seconds to load the chat screen.

biswaroop1547 · 2023-11-03T14:24:21Z

@filopedraz yeah it takes longer if it's creating env from start, but if env is already present then it takes minimum 30 secs to maximum 1 min, do we want to show some kind of message when this is happening? (it's mentioned as one of the issues/todos in this PR desc)

This is the actual time it takes for the loading of model into memory after starting up the server, up for ideas here on what we can do to reduce this time though 🙏🏻

filopedraz · 2023-11-03T18:48:43Z

Good for now. I am more worried about the time between the toast and the load of the chat. I don't know what's. It happens to me with Mistral too actually.

filopedraz · 2023-11-04T07:59:00Z

Download doesn't even start now. Here a loom.

filopedraz · 2023-11-04T08:35:53Z

Now it seems to work. I also rebased with main, but there seems to be an issue in the generation:

Seems related to a special token it'

filopedraz · 2023-11-04T08:41:49Z

The PR looks good and it works well for me. I created a new issue here for what concerns the generation.

casperdcl · 2023-11-04T08:43:30Z

I suggest we squash-merge because it's ultimately quite small & not worth rebasing/preserving history

filopedraz and others added 30 commits October 26, 2023 16:11

added support for swarm mode

d8f2865

stable petals release fixed version

edd104d

Fix style + add PETALS tag

88f68fd

updated small fix readme

9e5a6b7

show swarm mode only on macos

e44f024

added num_blocks parameter; start petals does not work

c1bdb32

clean up

45fdfb7

bugfix

4ac0c78

added hardcoded model selection

e08099b

added model selection

178a104

added username from whomai (with prem-app as fallback)

2a83115

removed max val for num blocks

1a025a2

build fixed

a2ac4c2

added public name parameter (default prem-app)

b328aa6

moved petals swarm to sidecar

a5b8f45

fix: supports Apple Silicon and little UI tweaks

375b81e

moved petals sidecar script to scripts; bugfix in SwarmMode

b65ef35

added get_username tauri command

8a18a05

added swarm-mode only for aarch64

8275a12

renamed petals binary from petals-aarch64-apple-darwin to petals-univ…

8a2667c

…ersal-apple-darwin

CI: workflow tidy

52ae6a4

build_petals: parameterise for arches

86f1e8a

moving swarm mode from sidecar to miniconda env

7e7ffe9

bugfix in swarm command

0db39e1

removed sidecar

25c7993

added env deletion script

50ece5a

Async run_swarm_mode

2936bd6

Co-authored-by: Casper da Costa-Luis <casper.dcl@physics.org>

added spinner while creating the environment

0a76cbf

replaced wget with user wget

16aafc7

added fix for including path vars

6f599ee

chore: removed todos

395ff24

biswaroop1547 marked this pull request as ready for review October 31, 2023 11:04

fix: lint

4715323

biswaroop1547 added 2 commits November 2, 2023 22:51

update: merge main branch

2ea9d5b

Merge branch 'main' of github.com:premAI-io/prem-app into feat/petals…

0e2223e

…-services-python

biswaroop1547 added 2 commits November 3, 2023 14:27

Merge branch 'main' of github.com:premAI-io/prem-app into feat/petals…

374ac43

…-services-python

update: changes on create environment due to swarm mode changes

47157d4

biswaroop1547 force-pushed the feat/petals-services-python branch from 93488c4 to 47157d4 Compare November 3, 2023 09:02

fix: lint

0a06db0

biswaroop1547 requested a review from casperdcl November 3, 2023 09:22

chore: use dev registry

9fa393b

Merge branch 'main' into feat/petals-services-python

d726a72

filopedraz mentioned this pull request Nov 4, 2023

Stable Beluga generation issue premAI-io/prem-services#139

Closed

filopedraz approved these changes Nov 4, 2023

View reviewed changes

filopedraz merged commit 4c35880 into bit-gpt:main Nov 4, 2023
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

petals models local support using python env #435

petals models local support using python env #435

biswaroop1547 commented Oct 27, 2023 •

edited

Loading

casperdcl commented Nov 2, 2023 •

edited

Loading

tiero commented Nov 2, 2023 •

edited

Loading

biswaroop1547 commented Nov 3, 2023 •

edited

Loading

biswaroop1547 commented Nov 3, 2023 •

edited

Loading

tiero commented Nov 3, 2023

biswaroop1547 commented Nov 3, 2023

filopedraz commented Nov 3, 2023

casperdcl commented Nov 3, 2023 •

edited

Loading

filopedraz commented Nov 3, 2023

biswaroop1547 commented Nov 3, 2023 •

edited

Loading

filopedraz commented Nov 3, 2023

filopedraz commented Nov 4, 2023

filopedraz commented Nov 4, 2023 •

edited

Loading

filopedraz commented Nov 4, 2023

casperdcl commented Nov 4, 2023

petals models local support using python env #435

petals models local support using python env #435

Conversation

biswaroop1547 commented Oct 27, 2023 • edited Loading

casperdcl commented Nov 2, 2023 • edited Loading

tiero commented Nov 2, 2023 • edited Loading

biswaroop1547 commented Nov 3, 2023 • edited Loading

biswaroop1547 commented Nov 3, 2023 • edited Loading

tiero commented Nov 3, 2023

biswaroop1547 commented Nov 3, 2023

filopedraz commented Nov 3, 2023

casperdcl commented Nov 3, 2023 • edited Loading

filopedraz commented Nov 3, 2023

biswaroop1547 commented Nov 3, 2023 • edited Loading

filopedraz commented Nov 3, 2023

filopedraz commented Nov 4, 2023

filopedraz commented Nov 4, 2023 • edited Loading

filopedraz commented Nov 4, 2023

casperdcl commented Nov 4, 2023

biswaroop1547 commented Oct 27, 2023 •

edited

Loading

casperdcl commented Nov 2, 2023 •

edited

Loading

tiero commented Nov 2, 2023 •

edited

Loading

biswaroop1547 commented Nov 3, 2023 •

edited

Loading

biswaroop1547 commented Nov 3, 2023 •

edited

Loading

casperdcl commented Nov 3, 2023 •

edited

Loading

biswaroop1547 commented Nov 3, 2023 •

edited

Loading

filopedraz commented Nov 4, 2023 •

edited

Loading