Pinned Loading
-
facebookresearch/LayerSkip
facebookresearch/LayerSkip PublicCode for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024
-
facebookresearch/MODel_opt
facebookresearch/MODel_opt Public archiveMemory Optimizations for Deep Learning (ICML 2023)
-
pytorch-labs/superblock
pytorch-labs/superblock Public archiveA block oriented training approach for inference time optimization.
-
-
selkerdawy/FTWT
selkerdawy/FTWT PublicFire Together Wire Together: A Dynamic Pruning Approach with Self-Supervised Mask Prediction
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.