Skip to content

Actions: huggingface/text-generation-inference

CI build

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
1,670 workflow runs
1,670 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Fix: Change model_type from ssm to mamba
CI build #1751: Pull request #2740 opened by mokeddembillel
November 10, 2024 22:49 Action required mokeddembillel:fix/ssm-to-mamba
November 10, 2024 22:49 Action required
Fix: Change embeddings to embedding
CI build #1750: Pull request #2738 opened by mokeddembillel
November 10, 2024 22:05 Action required mokeddembillel:fix/mamba-model
November 10, 2024 22:05 Action required
Add initial support for compressed-tensors checkpoints (#2732)
CI build #1749: Commit a785000 pushed by danieldk
November 10, 2024 12:54 1d 13h 30m 49s main
November 10, 2024 12:54 1d 13h 30m 49s
Add llama.cpp backend
CI build #1748: Pull request #2723 synchronize by mfuntowicz
November 9, 2024 21:19 1d 5h 5m 16s feat-backend-llamacpp
November 9, 2024 21:19 1d 5h 5m 16s
Add llama.cpp backend
CI build #1747: Pull request #2723 synchronize by mfuntowicz
November 9, 2024 21:10 9m 23s feat-backend-llamacpp
November 9, 2024 21:10 9m 23s
Support continue final message
CI build #1746: Pull request #2733 synchronize by drbh
November 8, 2024 20:47 1d 5h 37m 54s support-continue-final-message
November 8, 2024 20:47 1d 5h 37m 54s
Support continue final message
CI build #1745: Pull request #2733 opened by drbh
November 8, 2024 19:04 1h 53m 5s support-continue-final-message
November 8, 2024 19:04 1h 53m 5s
Add initial support for compressed-tensors checkpoints
CI build #1744: Pull request #2732 synchronize by danieldk
November 8, 2024 12:23 1d 14h 1m 2s feature/compressed-tensors
November 8, 2024 12:23 1d 14h 1m 2s
Add initial support for compressed-tensors checkpoints
CI build #1743: Pull request #2732 synchronize by danieldk
November 8, 2024 10:34 1h 50m 4s feature/compressed-tensors
November 8, 2024 10:34 1h 50m 4s
Add initial support for compressed-tensors checkpoints
CI build #1742: Pull request #2732 synchronize by danieldk
November 8, 2024 10:25 16m 4s feature/compressed-tensors
November 8, 2024 10:25 16m 4s
Add initial support for compressed-tensors checkpoints
CI build #1741: Pull request #2732 synchronize by danieldk
November 8, 2024 08:40 1h 45m 16s feature/compressed-tensors
November 8, 2024 08:40 1h 45m 16s
add ipex moe implementation to support Mixtral and PhiMoe
CI build #1740: Pull request #2707 synchronize by sywangyi
November 8, 2024 06:10 Action required sywangyi:moe
November 8, 2024 06:10 Action required
Add initial support for compressed-tensors checkpoints
CI build #1739: Pull request #2732 synchronize by danieldk
November 7, 2024 14:46 1d 2h 29m 34s feature/compressed-tensors
November 7, 2024 14:46 1d 2h 29m 34s
Add initial support for compressed-tensors checkpoints
CI build #1738: Pull request #2732 synchronize by danieldk
November 7, 2024 13:54 1h 1m 36s feature/compressed-tensors
November 7, 2024 13:54 1h 1m 36s
add trust_remote_code in tokenizer to fix baichuan issue (#2725)
CI build #1737: Commit 97f7a22 pushed by Narsil
November 7, 2024 13:43 1d 12h 41m 20s main
November 7, 2024 13:43 1d 12h 41m 20s
Add initial support for compressed-tensors checkpoints
CI build #1736: Pull request #2732 opened by danieldk
November 7, 2024 13:12 54m 18s feature/compressed-tensors
November 7, 2024 13:12 54m 18s
Add llama.cpp backend
CI build #1735: Pull request #2723 synchronize by mfuntowicz
November 5, 2024 22:48 1d 3h 36m 27s feat-backend-llamacpp
November 5, 2024 22:48 1d 3h 36m 27s
feat: add payload limit
CI build #1734: Pull request #2726 opened by OlivierDehaene
November 5, 2024 15:39 1d 10h 45m 46s feat/limit
November 5, 2024 15:39 1d 10h 45m 46s
Upgrade outlines v2
CI build #1732: Pull request #2724 synchronize by aW3st
November 5, 2024 04:36 Action required aW3st:upgrade-outlines-v2
November 5, 2024 04:36 Action required
add ipex moe implementation to support Mixtral and PhiMoe
CI build #1731: Pull request #2707 synchronize by sywangyi
November 5, 2024 02:07 Action required sywangyi:moe
November 5, 2024 02:07 Action required
Upgrade outlines v2
CI build #1730: Pull request #2724 opened by aW3st
November 4, 2024 23:55 Action required aW3st:upgrade-outlines-v2
November 4, 2024 23:55 Action required
Add llama.cpp backend
CI build #1729: Pull request #2723 opened by mfuntowicz
November 4, 2024 22:25 1d 0h 31m 5s feat-backend-llamacpp
November 4, 2024 22:25 1d 0h 31m 5s
feat: support flash attention 2 in qwen2 vl vision blocks
CI build #1728: Pull request #2721 opened by drbh
November 4, 2024 16:29 1d 9h 55m 8s support-flash-qwen2-vl
November 4, 2024 16:29 1d 9h 55m 8s
fix incorrect output of Qwen2-7B-Instruct-GPTQ-Int4 and Qwen2-7B-Inst…
CI build #1727: Commit b1f9044 pushed by danieldk
November 4, 2024 15:07 1d 11h 17m 12s main
November 4, 2024 15:07 1d 11h 17m 12s
Update to moe-kernels 0.7.0
CI build #1726: Pull request #2720 opened by danieldk
November 4, 2024 15:05 1d 11h 19m 44s maintenance/marlin-kernels-0.7.0
November 4, 2024 15:05 1d 11h 19m 44s