-
Notifications
You must be signed in to change notification settings - Fork 60
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Alabulei1 add llama2 inference #182
Alabulei1 add llama2 inference #182
Conversation
Hello, I am a code review bot on flows.network. Here are my reviews of code commits in this PR. Overall, the pull request titled "Alabulei1 add llama2 inference" seems to be focused on adding support and documentation for running llama2 inference using the ggml backend in WasmEdge and Rust. The changes include adding a new section for the WASI-NN plug-in with ggml backend, providing installation instructions for WasmEdge with the ggml backend, and adding a link to the chapter on running AI inference with llama2 models. Most of the individual summaries did not identify any potential problems or issues with the patch. However, there are a few findings that should be addressed:
In summary, the pull request adds valuable documentation and functionality for running llama2 inference using the ggml backend. It is important to address the identified issues and errors to ensure the overall quality of the patch. DetailsCommit ce0d25ca6ea021db50702ac0a78fe710cfeda44bKey changes:
Potential problems:
Overall, the changes seem to be straightforward and focused on adding support for llama2 inference using the ggml backend. Commit 7e32ab6e6b8d7d597ae25bb0086d712a9fc64a1fKey changes in the pull request:
Potential problems:
Additional comments:
Commit 7d95466766594169c7e6deb131e2efb23ef8dc00Key changes:
Potential problems:
Overall, the changes seem to be small and mostly cosmetic. There are no major issues with the patch. Commit ccc52f29a298fceae0583183e5e5a9f2be4533c6Key changes in the patch:
Potential problems:
Commit b05a011aaa958338c362455be61a215840400505Key changes:
Potential problems:
Commit 5c047dc9d6df5c0ac4c5bfc21f17c0646b24a1e0Key changes:
Potential problems:
Overall, the patch seems to add documentation for running llama2 inferences using WasmEdge and Rust. It would be important to review the Commit 106c7505e592c17b7ff728549d42e91f12110bf1Key changes in the patch:
Potential problems:
Overall, the changes in this patch seem relatively minor and do not raise any major concerns. Commit e54976134fb741332b37a43e12139b00c261b972Key changes in the patch:
Potential problems:
Commit 6d118bcb271db98433267ded3272303a33d60a1fKey changes in the patch:
Potential problems:
Commit 1461be0f00429f76af5529e7506fa1ea3a500a58Key changes:
Potential problems:
Commit 4c5cd77c01e519d2f4db3d79f1bb391b48cd3898Key changes:
Potential problems:
Commit 112214db0d07090bc177877348fb116e4175efa7Key Changes:
Potential Problems:
|
Signed-off-by: alabulei1 <vivian.xiage@gmail.com>
Signed-off-by: alabulei1 <vivian.xiage@gmail.com>
1 We should download models from our HF repo now: https://huggingface.co/wasmedge/llama2 2 I think it is easier to walkthrough the code for the No need for code walkthrough for the latter two examples. |
Signed-off-by: alabulei1 <vivian.xiage@gmail.com>
Signed-off-by: alabulei1 <vivian.xiage@gmail.com>
Signed-off-by: alabulei1 <vivian.xiage@gmail.com>
Hi @juntao The How about this one? https://github.com/second-state/WasmEdge-WASINN-examples/blob/master/wasmedge-ggml-llama-interactive/src/main.rs Will two different repos confuse readers? |
Signed-off-by: alabulei1 <vivian.xiage@gmail.com>
Signed-off-by: alabulei1 <vivian.xiage@gmail.com>
The simple example should not have chat template. It should not support conversation history either. It should just let the user manually assemble a promote and it will output a string and exit. We will need to simplify it. The other example will be removed later. There cannot be two sources of truth. |
@apepkuss Could you please check out the comments above -- simplify the |
Will update |
Signed-off-by: alabulei1 <vivian.xiage@gmail.com>
Signed-off-by: alabulei1 <vivian.xiage@gmail.com>
Signed-off-by: alabulei1 <vivian.xiage@gmail.com>
Add docs for