feat: Support nexa model with nexa-sdk #1053

LuciusMos · 2024-10-14T22:16:15Z

Description

Support v1/chat/completions

Motivation and Context

close #925

I have raised an issue to propose this change (required for new features and bug fixes)

Types of changes

What types of changes does your code introduce? Put an x in all the boxes that apply:

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds core functionality)
Breaking change (fix or feature that would cause existing functionality to change)
Documentation (update in the documentation)
Example (update in the folder of example)

Implemented Tasks

Subtask 1
Subtask 2
Subtask 3

Checklist

Go over all the following points, and put an x in all the boxes that apply.
If you are unsure about any of these, don't hesitate to ask. We are here to help!

I have read the CONTRIBUTION guide. (required)
My change requires a change to the documentation.
I have updated the tests accordingly. (required for a bug fix or a new feature)
I have updated the documentation accordingly.

… not support start_server. not support other callings in nexa-sdk

Wendong-Fan

Thanks @LuciusMos , left some comments below, I think for now there's some issue which may make the model can't be called as expected, could you do some further test and add example? Thanks

Wendong-Fan · 2024-10-15T07:26:57Z

camel/models/nexa_model.py

+
+    def __init__(
+        self,
+        model_type: ModelType,


there's no defined ModelType for Nexa

Should I define ModelType for Nexa? Or should I just use str type for model platforms like Nexa as well as vLLM, ollama?

now we support both ModelType and str as model_type input, please update with our latest master branch, I think for Nexa we can just use str since there are many models hosted

Wendong-Fan · 2024-10-15T07:27:40Z

camel/models/nexa_model.py

+            response = self._client.chat.completions.create(
+                messages=messages,
+                # model=self.model_type,
+                model="model-type",


I think here shouldn't be hard coded

Same question as above. This would be solved if I simply use str type.

Wendong-Fan · 2024-10-15T07:28:59Z

camel/configs/nexa_config.py

+            the tokens with top_p probability mass. So :obj:`0.1` means only
+            the tokens comprising the top 10% probability mass are considered.
+            (default: :obj:`1.0`)
+        top_k (int, optional): The number of highest probability vocabulary


If we are going to use openai compatibility, I think this parameter would not be accepted

Since there is no evidence that nexa-sdk officially supports OpenAI serving (but it works well in my testing), do you think it is ok for us to delete top_k and nctx for now, and support them in the future if there are updates in nexa-sdk (and openai)?

from https://github.com/NexaAI/nexa-sdk they mentioned that they offers an OpenAI-compatible API server, and we can delete top_k and nctx for now

Wendong-Fan · 2024-10-15T07:29:30Z

camel/configs/nexa_config.py

+            (default: :obj:`False`)
+        stop (str or list, optional): List of stop words for early stopping
+            generating further tokens. (default: :obj:`None`)
+        nctx (int, optional): Length of context window. (default: :obj:`2048`)


If we are going to use openai compatibility, I think this parameter would not be accepted

…bility with OpenAI serving

init commit. support nexa-sdk v1/chat/completions. no test & example.…

b2bfacb

… not support start_server. not support other callings in nexa-sdk

LuciusMos self-assigned this Oct 14, 2024

LuciusMos requested a review from Wendong-Fan October 14, 2024 22:16

Wendong-Fan added the Model Related to backend models label Oct 15, 2024

Wendong-Fan added this to the Sprint 14 milestone Oct 15, 2024

Wendong-Fan linked an issue Oct 15, 2024 that may be closed by this pull request

[Feature Request] Integrate Nexa On-Device AI models #925

Open

2 tasks

Wendong-Fan reviewed Oct 15, 2024

View reviewed changes

Wendong-Fan changed the title ~~Support nexa model with nexa-sdk~~ feat: Support nexa model with nexa-sdk Oct 15, 2024

LuciusMos added 3 commits October 18, 2024 16:41

Resolve merge conflict in model_factory.py

fbca003

Use str type ModelType for NexaModel; remove top_k & nctx for compati…

d2a7ef6

…bility with OpenAI serving

Merge branch 'master' into feature/model-add-nexa

3af55ca

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Support nexa model with nexa-sdk #1053

feat: Support nexa model with nexa-sdk #1053

LuciusMos commented Oct 14, 2024 •

edited by Wendong-Fan

Loading

Wendong-Fan left a comment

Wendong-Fan Oct 15, 2024

LuciusMos Oct 17, 2024

Wendong-Fan Oct 18, 2024 •

edited

Loading

Wendong-Fan Oct 15, 2024

LuciusMos Oct 17, 2024

Wendong-Fan Oct 15, 2024

LuciusMos Oct 17, 2024

Wendong-Fan Oct 18, 2024 •

edited

Loading

Wendong-Fan Oct 15, 2024

feat: Support nexa model with nexa-sdk #1053

Are you sure you want to change the base?

feat: Support nexa model with nexa-sdk #1053

Conversation

LuciusMos commented Oct 14, 2024 • edited by Wendong-Fan Loading

Description

Motivation and Context

Types of changes

Implemented Tasks

Checklist

Wendong-Fan left a comment

Choose a reason for hiding this comment

Wendong-Fan Oct 15, 2024

Choose a reason for hiding this comment

LuciusMos Oct 17, 2024

Choose a reason for hiding this comment

Wendong-Fan Oct 18, 2024 • edited Loading

Choose a reason for hiding this comment

Wendong-Fan Oct 15, 2024

Choose a reason for hiding this comment

LuciusMos Oct 17, 2024

Choose a reason for hiding this comment

Wendong-Fan Oct 15, 2024

Choose a reason for hiding this comment

LuciusMos Oct 17, 2024

Choose a reason for hiding this comment

Wendong-Fan Oct 18, 2024 • edited Loading

Choose a reason for hiding this comment

Wendong-Fan Oct 15, 2024

Choose a reason for hiding this comment

LuciusMos commented Oct 14, 2024 •

edited by Wendong-Fan

Loading

Wendong-Fan Oct 18, 2024 •

edited

Loading

Wendong-Fan Oct 18, 2024 •

edited

Loading