Skip to content

Actions: huggingface/optimum-neuron

Build documentation

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
263 workflow runs
263 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Address several NeuronModelForCausalLM and TGI fixes issues (#454)
Build documentation #217: Commit eb2a93f pushed by dacorvo
January 31, 2024 16:48 2m 34s main
January 31, 2024 16:48 2m 34s
AWS Neuron SDK 2.16.1: update neuronxcc (#449)
Build documentation #216: Commit c345de4 pushed by dacorvo
January 31, 2024 08:57 2m 31s main
January 31, 2024 08:57 2m 31s
[Pre Neuron Inf Cache system]Support neff/weights decoupling (#402)
Build documentation #215: Commit de5752d pushed by JingyaHuang
January 30, 2024 16:39 2m 28s main
January 30, 2024 16:39 2m 28s
TGI: export model if configuration is cached (#445)
Build documentation #214: Commit c114fc8 pushed by dacorvo
January 30, 2024 16:03 3m 20s main
January 30, 2024 16:03 3m 20s
Add neuronx cache registry (#442)
Build documentation #213: Commit 0f7bf4a pushed by dacorvo
January 26, 2024 15:43 2m 30s main
January 26, 2024 15:43 2m 30s
Allow exporting decoder models using optimum-cli (#422)
Build documentation #212: Commit 7d0dbb5 pushed by dacorvo
January 25, 2024 14:39 2m 37s main
January 25, 2024 14:39 2m 37s
Add Llama2 inference benchmark under a new "benchmarks" section (#435)
Build documentation #211: Commit 2709183 pushed by dacorvo
January 24, 2024 15:17 2m 46s main
January 24, 2024 15:17 2m 46s
[Documentation] Add Sentence Transformers Guide and Notebook (#434)
Build documentation #210: Commit ebfa141 pushed by philschmid
January 23, 2024 17:55 2m 27s main
January 23, 2024 17:55 2m 27s
Initial support for Pipeline Parallelism (#279)
Build documentation #209: Commit ca6c4ff pushed by michaelbenayoun
January 23, 2024 16:05 2m 33s main
January 23, 2024 16:05 2m 33s
Docs nits (#428)
Build documentation #208: Commit b643d7f pushed by michaelbenayoun
January 23, 2024 14:57 2m 26s main
January 23, 2024 14:57 2m 26s
API change to be compatible to Optimum (#421)
Build documentation #207: Commit ff293f0 pushed by JingyaHuang
January 22, 2024 11:06 2m 32s main
January 22, 2024 11:06 2m 32s
Minor doc fix (#432)
Build documentation #206: Commit a380d2d pushed by JingyaHuang
January 21, 2024 12:00 2m 21s main
January 21, 2024 12:00 2m 21s
Improve doc and notebooks push to hub (#429)
Build documentation #205: Commit 61143e2 pushed by JingyaHuang
January 19, 2024 12:48 2m 35s main
January 19, 2024 12:48 2m 35s
chore: bump dev version (#427)
Build documentation #204: Commit 2ca9c74 pushed by dacorvo
January 19, 2024 09:44 2m 28s main
January 19, 2024 09:44 2m 28s
release: v0.0.17
Build documentation #203: Commit 8d4b6dc pushed by dacorvo
January 19, 2024 07:19 3m 28s v0.0.17
January 19, 2024 07:19 3m 28s
Do not upload NeuronModelForCausalLM weights when they can be reconst…
Build documentation #202: Commit c60935b pushed by dacorvo
January 18, 2024 10:49 2m 58s main
January 18, 2024 10:49 2m 58s
Add Neuronx compile cache proxy and use it for LLM decoder models (#410)
Build documentation #201: Commit f81c365 pushed by dacorvo
January 17, 2024 09:39 2m 24s main
January 17, 2024 09:39 2m 24s
Fix typo for NeuronSentenceTransformers class (#412)
Build documentation #200: Commit 66c42d7 pushed by dacorvo
January 17, 2024 07:54 2m 25s main
January 17, 2024 07:54 2m 25s
Add general support for generation on TRN with NxD (#370)
Build documentation #199: Commit 8fd86c1 pushed by dacorvo
January 17, 2024 07:53 2m 24s main
January 17, 2024 07:53 2m 24s
[Inference] Improve the support of sentence transformers (#408)
Build documentation #198: Commit 9837efa pushed by JingyaHuang
January 16, 2024 21:24 2m 27s main
January 16, 2024 21:24 2m 27s
Add support for Mistral models (#411)
Build documentation #197: Commit 43d2f90 pushed by dacorvo
January 16, 2024 07:29 2m 30s main
January 16, 2024 07:29 2m 30s
Skip pushing if the user does not have write access to the cache repo…
Build documentation #196: Commit 104bd64 pushed by michaelbenayoun
January 15, 2024 16:17 2m 26s main
January 15, 2024 16:17 2m 26s
Bump hf libraries versions (#403)
Build documentation #195: Commit 923398e pushed by JingyaHuang
January 11, 2024 11:38 2m 33s main
January 11, 2024 11:38 2m 33s
[documentation] Add Llama 7B Guide (#401)
Build documentation #194: Commit 05e6822 pushed by philschmid
January 10, 2024 13:43 2m 27s main
January 10, 2024 13:43 2m 27s
Use AWS Neuron SDK 2.16 packages (#398)
Build documentation #193: Commit 3b3afa4 pushed by dacorvo
January 10, 2024 09:34 2m 35s main
January 10, 2024 09:34 2m 35s