-
Notifications
You must be signed in to change notification settings - Fork 1k
Pull requests: huggingface/text-generation-inference
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat: enable pytorch xpu support for non-attention models
#2561
opened Sep 24, 2024 by
dvrogozh
Loading…
Add support for GPTQ-quantized MoE models using MoE Marlin
#2557
opened Sep 24, 2024 by
danieldk
Loading…
5 tasks
do not set sliding_window if SUPPORTS_WINDOWING is false
#2554
opened Sep 24, 2024 by
sywangyi
Loading…
5 tasks
CI for add gptq and awq int4 support in intel platform
#2494
opened Sep 5, 2024 by
ErikKaum
Loading…
fix: skip cuda graphs that will oom and improve free memory logging
#2450
opened Aug 22, 2024 by
drbh
Loading…
add gptq and awq int4 support in intel platform
#2444
opened Aug 22, 2024 by
sywangyi
Loading…
5 tasks
[TENSORRT-LLM] - Implement new looper thread based backend
#2357
opened Aug 2, 2024 by
mfuntowicz
•
Draft
ProTip!
Adding no:label will show everything without a label.