[TritonGEN] Add `triton_gen.cache_controls` operation #1087

victor-eds · 2024-05-10T15:08:32Z

Add triton_gen.cache_controls operation to represent SPV_INTEL_cache_controls decorations in MLIR.

This operation does not convert to any operation in a different dialect, as it supports translation straight to LLVM IR as metadata.

victor-eds · 2024-05-10T15:10:05Z

~~This is a highly experimental PR. Not ready to merge at all, as this is just a POC. Missing:~~

~~Cache control specification via special attributes instead of just i32 attribute~~
~~Check this works in the pipeline (translation interface is properly registered)~~ (should be working as the translation interface is being used)
~~Support more than one annotation by not overriding MD if present~~

whitneywhtsang · 2024-05-21T17:35:37Z

Instead of expressing cache control at the pointer, the new proposal is to express at the memory operation (KhronosGroup/SPIRV-LLVM-Translator#2587).

Add `triton_gen.cache_control` operation to represent [SPV_INTEL_cache_controls](https://htmlpreview.github.io/?https://github.com/KhronosGroup/SPIRV-Registry/blob/main/extensions/INTEL/SPV_INTEL_cache_controls.html) decorations in MLIR. This operation does not convert to any operation in a different dialect, as it supports translation straight to LLVM IR as metadata. `triton-translate` is a new tool used to test this translation. Signed-off-by: Victor Perez <victor.perez@codeplay.com>

victor-eds · 2024-05-29T15:05:17Z

Instead of expressing cache control at the pointer, the new proposal is to express at the memory operation (KhronosGroup/SPIRV-LLVM-Translator#2587).

Updated implementation. This is ready to review now.

whitneywhtsang · 2024-05-29T15:15:29Z

Given that the new design has cache control on the memory operation instead of the pointer, would it make sense for TritonGEN to add an attribute instead of a new operation that express the cache control on the pointer?

etiotto · 2024-05-29T15:29:42Z

Given that the new design has cache control on the memory operation instead of the pointer, would it make sense for TritonGEN to add an attribute instead of a new operation that express the cache control on the pointer?

The cache control is a property of the load/store operations rather than the ptr used by those operations. So I think the attribute is a better way to go.

victor-eds · 2024-05-29T15:30:58Z

The cache control is a property of the load/store operations rather than the ptr used by those operations. So I think the attribute is a better way to go.

My fear is that MLIR transform passes may drop these attributes, as these are not guaranteed to be preserved. That's why I went with this design.

whitneywhtsang · 2024-05-29T15:36:53Z

My fear is that MLIR transform passes may drop these attributes, as these are not guaranteed to be preserved. That's why I went with this design.

It is the same concern for metadata on memory operations too. Do you think there is a high chance that attributes would be dropped in practice?

victor-eds · 2024-05-29T15:40:45Z

My fear is that MLIR transform passes may drop these attributes, as these are not guaranteed to be preserved. That's why I went with this design.

It is the same concern for metadata on memory operations too. Do you think there is a high chance that attributes would be dropped in practice?

That will depend on how late in the pipeline these are added. The only way we can make sure these are added is if the cache control information is to be added as part of a conversion to LLVM dialect pattern. Otherwise, even if we have an operation foo correctly annotated and converting to llvm.store, we cannot guarantee the attributes will be carried over in the conversion to llvm.store.

etiotto · 2024-05-29T16:09:03Z

The cache control is a property of the load/store operations rather than the ptr used by those operations. So I think the attribute is a better way to go.

My fear is that MLIR transform passes may drop these attributes, as these are not guaranteed to be preserved. That's why I went with this design.

Got it, however the chance is small because we aren't running any transformations on the TritonGEN code

victor-eds · 2024-05-29T16:13:00Z

The cache control is a property of the load/store operations rather than the ptr used by those operations. So I think the attribute is a better way to go.

My fear is that MLIR transform passes may drop these attributes, as these are not guaranteed to be preserved. That's why I went with this design.

Got it, however the chance is small because we aren't running any transformations on the TritonGEN code

Yeah, I was thinking more on:

That will depend on how late in the pipeline these are added. The only way we can make sure these are added is if the cache control information is to be added as part of a conversion to LLVM dialect pattern. Otherwise, even if we have an operation foo correctly annotated and converting to llvm.store, we cannot guarantee the attributes will be carried over in the conversion to llvm.store.

If the cache control is to be set only when creating the corresponding llvm.load/store operation, the attribute approach might work.

victor-eds · 2024-05-30T13:51:23Z

Are we fine with current approach or do we wanna switch to using discardable attributes in operations? @whitneywhtsang @etiotto

If this cache info is to be inserted as part of the conversion to LLVM dialect, I think we can go with the attribute design.

whitneywhtsang · 2024-05-30T13:54:54Z

Are we fine with current approach or do we wanna switch to using discardable attributes in operations? @whitneywhtsang @etiotto

My preference is attributes in operations.

etiotto · 2024-05-30T14:24:37Z

My preference is attributes in operations.

The 2D block load operation in the TritonGen dialect already has an attribute for the cache control. When we lower that operation into a call we cannot pass the cache control as an argument because the "builtin" call we need to generate does not have that argument. Instead of taking the cache control as an argument we need to generate metadata and attach it to the function call.
Can we do that cleanly ?

Signed-off-by: Victor Perez <victor.perez@codeplay.com>

victor-eds · 2024-05-30T15:50:09Z

The 2D block load operation in the TritonGen dialect already has an attribute for the cache control. When we lower that operation into a call we cannot pass the cache control as an argument because the "builtin" call we need to generate does not have that argument. Instead of taking the cache control as an argument we need to generate metadata and attach it to the function call.

With the changes I've just pushed, that'd involve attaching an attribute to the call operation as in the examples.

test/Target/LLVMIR/triton-gen.mlir

test/TritonGEN/tritongen-invalid.mlir

victor-eds requested review from jopperm, FMarno, whitneywhtsang and etiotto May 10, 2024 15:08

victor-eds self-assigned this May 10, 2024

victor-eds marked this pull request as draft May 10, 2024 15:09

whitneywhtsang linked an issue May 13, 2024 that may be closed by this pull request

Investigate visibility to translate straight to LLVM IR metadata #1117

Closed

sommerlukas mentioned this pull request May 20, 2024

Avoid post-processing generated LLVM IR #1150

Merged

victor-eds mentioned this pull request May 29, 2024

Investigate visibility to translate straight to LLVM IR metadata #1117

Closed

victor-eds marked this pull request as ready for review May 29, 2024 14:54

victor-eds force-pushed the annotated-ptr-op branch from 81bf974 to 49719dd Compare May 29, 2024 14:55

Merge branch 'llvm-target' into annotated-ptr-op

38b7413

victor-eds added 2 commits May 29, 2024 16:09

Document new operation.

aca1689

NIT

8172daa

Fail gracefully instead of erroring out

2cca769

victor-eds requested a review from a team May 29, 2024 15:32

Merge branch 'llvm-target' into annotated-ptr-op

0fbde8d

victor-eds changed the title ~~[TritonGEN] Add triton_gen.cache_control operation~~ [TritonGEN] Add triton_gen.cache_controls operation May 29, 2024

Use SmallSet

52063ec

Use attribute to represent cache controls

076bf46

Signed-off-by: Victor Perez <victor.perez@codeplay.com>

victor-eds commented May 30, 2024

View reviewed changes

test/Target/LLVMIR/triton-gen.mlir Outdated Show resolved Hide resolved

victor-eds and others added 2 commits May 30, 2024 17:00

Allow empty decorations

2ab1910

Merge branch 'llvm-target' into annotated-ptr-op

c673631

etiotto reviewed May 30, 2024

View reviewed changes

test/TritonGEN/tritongen-invalid.mlir Outdated Show resolved Hide resolved

victor-eds requested a review from etiotto May 31, 2024 14:21

Update tests

a253676

etiotto approved these changes May 31, 2024

View reviewed changes

victor-eds and others added 2 commits May 31, 2024 16:02

Update invalid test

440b16d

Merge branch 'llvm-target' into annotated-ptr-op

d0cd06a

whitneywhtsang merged commit 8ab1e45 into llvm-target Jun 1, 2024
2 checks passed

whitneywhtsang deleted the annotated-ptr-op branch June 1, 2024 01:10

whitneywhtsang mentioned this pull request Jun 4, 2024

[GEN] Use cache control attribute for 2D block read OCL lowering #1233

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TritonGEN] Add `triton_gen.cache_controls` operation #1087

[TritonGEN] Add `triton_gen.cache_controls` operation #1087

victor-eds commented May 10, 2024 •

edited

Loading

victor-eds commented May 10, 2024 •

edited

Loading

whitneywhtsang commented May 21, 2024 •

edited

Loading

victor-eds commented May 29, 2024

whitneywhtsang commented May 29, 2024

etiotto commented May 29, 2024

victor-eds commented May 29, 2024

whitneywhtsang commented May 29, 2024

victor-eds commented May 29, 2024 •

edited

Loading

etiotto commented May 29, 2024

victor-eds commented May 29, 2024

victor-eds commented May 30, 2024 •

edited

Loading

whitneywhtsang commented May 30, 2024

etiotto commented May 30, 2024

victor-eds commented May 30, 2024

[TritonGEN] Add triton_gen.cache_controls operation #1087

[TritonGEN] Add triton_gen.cache_controls operation #1087

Conversation

victor-eds commented May 10, 2024 • edited Loading

victor-eds commented May 10, 2024 • edited Loading

whitneywhtsang commented May 21, 2024 • edited Loading

victor-eds commented May 29, 2024

whitneywhtsang commented May 29, 2024

etiotto commented May 29, 2024

victor-eds commented May 29, 2024

whitneywhtsang commented May 29, 2024

victor-eds commented May 29, 2024 • edited Loading

etiotto commented May 29, 2024

victor-eds commented May 29, 2024

victor-eds commented May 30, 2024 • edited Loading

whitneywhtsang commented May 30, 2024

etiotto commented May 30, 2024

victor-eds commented May 30, 2024

[TritonGEN] Add `triton_gen.cache_controls` operation #1087

[TritonGEN] Add `triton_gen.cache_controls` operation #1087

victor-eds commented May 10, 2024 •

edited

Loading

victor-eds commented May 10, 2024 •

edited

Loading

whitneywhtsang commented May 21, 2024 •

edited

Loading

victor-eds commented May 29, 2024 •

edited

Loading

victor-eds commented May 30, 2024 •

edited

Loading