-
Notifications
You must be signed in to change notification settings - Fork 2.2k
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[GPU] Optimize graph transformation for pytorch (#26410)
### Details: - Transpose fusion into MatMul may have caused perf drop even if tensor is aligned by 16 - For small tensors & aligned-by-16, fuse Transpose into MatMul. - For large tensors, do not fuse Transpose - Remove Pad in front of MaxPool. - MaxPool adds padding for CEIL_PYTORCH rounding type. - The pad should be removed if the pad_begin and pad_end are 0. Otherwise, it would cause perf drop. ### Tickets: - *150556*
- Loading branch information
Showing
3 changed files
with
92 additions
and
6 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters