forked from pytorch/FBGEMM
-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Profile with kineto #79
Closed
Closed
Changes from 1 commit
Commits
Show all changes
16 commits
Select commit
Hold shift + click to select a range
09293c7
Disable c10::optional macros in deeplearning
r-barnes b5140f2
Pull in PR for Kleidi-based FP16 kernel (#3507)
andrewjcg c02799e
Optimzed backward pass for ROCm devices (#3488)
q10 62f9db7
Fix grid size overflow in generate_vbe_metadata (#3484)
sryap 0b1739c
Support config based bound check version via extended modes (#3454)
804a499
Enable dynamic M grouped gemm (#3444)
jwfromm cc1bad1
fix mx4 illegal memory access (#3509)
eaa0961
support quantize_fp8_row for up to 4d non contiguous tensor (#3508)
ca4ea00
MX4 group size configuration for pyper (#3516)
qchip 7d1c763
deeplearning/fbgemm/fbgemm_gpu/experimental/gen_ai/test/quantize (#3512)
gmagogsfm a75d8fe
OSS build fixes (#3514)
q10 6da23d5
Fix index overflow for superlarge inputs (#3519)
jwfromm 5d6dd92
Async initialization of RockDB SSD tensors (#3520)
mradmila 12a2246
Explicitly update manylinux version (#3521)
q10 fe980ab
Add new optimizer state `row_counter` for Adam [Backend] (#3342)
spcyppt 64f8378
profile with kineto to eliminate the CPU overhead in benchmark
amirakb89 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Let's remove dead code