-
Notifications
You must be signed in to change notification settings - Fork 80
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
split local write instructions. #1180
base: develop
Are you sure you want to change the base?
Conversation
would this cause LDS bank conflict? |
|
[----------] Global test environment tear-down |
ds_store_b32 operations are issued by 32 threads at the same cycle. |
gfx90a passed |
========== 76 passed, 32 skipped, 1258 warnings in 8858.51s (2:27:38) ========== |
If we split a b128 into 4 x b32. We will have default 4-way bank conflict. |
ds_store_b128 takes more cycles then 1 mfma latency.
Split b128 into b32's and schedule into different mfma if ppossible.