-
Notifications
You must be signed in to change notification settings - Fork 53
Pull requests: NVIDIA/Fuser
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
NoOpScheduler::computeHeuristics should return NoOpHeuristic
#3670
opened Jan 3, 2025 by
jjsjann123
Loading…
Add register sharing to warp-specialized circular buffering
Matmuls
#3669
opened Jan 3, 2025 by
rdspring1
Loading…
EmbeddingOp
node with same functionality as F.embedding
#3649
opened Dec 26, 2024 by
Priya2698
Loading…
expand removing consecutive cast to handle meta operations in between
#3644
opened Dec 24, 2024 by
jjsjann123
Loading…
expand RemoveBcastSqueeze to handle unary operations between broadcast/squeeze ops
#3643
opened Dec 24, 2024 by
jjsjann123
Loading…
Split Hopper MMA by warp-tile before instruction tile
#3642
opened Dec 24, 2024 by
jacobhinkle
Loading…
Ring Allgather + GEMM Overlap HostIR Implementation
Multi-GPU
#3626
opened Dec 20, 2024 by
nsarka
Loading…
cacheInputs propagates allocation only for matmul schedulers.
#3621
opened Dec 19, 2024 by
wujingyue
Loading…
Support outer reduction scheduler with SOL autotuning
Autotune
Generate heuristics through machine learning models.
Lower distributed matmul to pipelined algorithm for fine-grained overlap
Multi-GPU
#3606
opened Dec 18, 2024 by
samnordmann
Loading…
2 tasks done
[wgmma] Insert commit_group and wait_group after mma_async
Matmuls
#3573
opened Dec 11, 2024 by
jacobhinkle
•
Draft
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.