[TKW] Teach expansion to handle non direct acc and ReduceOp on reduction dim. #243

raikonenfnu · 2024-10-26T18:29:41Z

In flash attention, we need to enable non direct acc matmul, and also expansion of reduceOp in reduction dimension. The former is needed in FA since we are applying some scaling to the acc of second MMA before feeding it in. The second case is required in FA because ReduceOp/MaxOp is in the backward slice of second MMA's LHS, which would require it to be expanded in K2/reduction dim as well.

harsh-nod

Overall, looks good! Thanks I like the changes to expansion. Just some comments about documentation .

iree/turbine/kernel/ops/wave_ops.py

iree/turbine/kernel/wave/expansion.py

Signed-off-by: Stanley Winata <stanley.winata@amd.com>

fixes on the equal is required sometimes it's not equal if we do not do this manual check of shape and type Signed-off-by: Stanley Winata <stanley.winata@amd.com>

Signed-off-by: Stanley Winata <stanley.winata@amd.com>

iree/turbine/kernel/ops/wave_ops.py

iree/turbine/kernel/wave/expansion.py

harsh-nod

overall looks good and we can address the open questions later. Thanks!

Signed-off-by: Stanley Winata <stanley.winata@amd.com>

raikonenfnu requested a review from harsh-nod October 26, 2024 18:29

harsh-nod requested changes Oct 28, 2024

View reviewed changes

raikonenfnu added 4 commits October 28, 2024 12:03

[TKW] Expand reduction dim on ReduceOp + non IterArg

e25eb16

Signed-off-by: Stanley Winata <stanley.winata@amd.com>

Package reduction of tiled mma into function

55e62da

Signed-off-by: Stanley Winata <stanley.winata@amd.com>

add expansion of ReduceOp

36e3203

Signed-off-by: Stanley Winata <stanley.winata@amd.com>

Fix Reduction expand to not add redundant expansion on IterArg + Output

46cc813

Signed-off-by: Stanley Winata <stanley.winata@amd.com>

raikonenfnu force-pushed the expandReduceOp branch from d9e0b24 to c7f5ac3 Compare October 28, 2024 19:09

raikonenfnu added 2 commits October 28, 2024 12:10

minor fixes and clean up

755a8ac

fixes on the equal is required sometimes it's not equal if we do not do this manual check of shape and type Signed-off-by: Stanley Winata <stanley.winata@amd.com>

Add comments/docs on why we start from 1

c7f5ac3

Signed-off-by: Stanley Winata <stanley.winata@amd.com>

raikonenfnu requested a review from harsh-nod October 28, 2024 19:18

Fix NITs and comments

979c605

Signed-off-by: Stanley Winata <stanley.winata@amd.com>

harsh-nod reviewed Oct 28, 2024

View reviewed changes

iree/turbine/kernel/ops/wave_ops.py Show resolved Hide resolved

harsh-nod reviewed Oct 28, 2024

View reviewed changes

iree/turbine/kernel/wave/expansion.py Show resolved Hide resolved

harsh-nod self-requested a review October 28, 2024 19:41

harsh-nod approved these changes Oct 28, 2024

View reviewed changes

raikonenfnu added 2 commits October 28, 2024 13:14

Add comments/docs detailing why we need the if conditions

7ab6f49

Signed-off-by: Stanley Winata <stanley.winata@amd.com>

Add more todo

c7561d1

Signed-off-by: Stanley Winata <stanley.winata@amd.com>

raikonenfnu merged commit ddc8dbd into iree-org:main Oct 28, 2024
7 of 8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TKW] Teach expansion to handle non direct acc and ReduceOp on reduction dim. #243

[TKW] Teach expansion to handle non direct acc and ReduceOp on reduction dim. #243

raikonenfnu commented Oct 26, 2024

harsh-nod left a comment

harsh-nod left a comment

[TKW] Teach expansion to handle non direct acc and ReduceOp on reduction dim. #243

[TKW] Teach expansion to handle non direct acc and ReduceOp on reduction dim. #243

Conversation

raikonenfnu commented Oct 26, 2024

harsh-nod left a comment

Choose a reason for hiding this comment

harsh-nod left a comment

Choose a reason for hiding this comment