Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
FindPackage(CUDA) has been deprecated and hence CUDA_VERSION is no longer set. Using CUDAToolkit_VERSION instead to determine whether or not to support CUDA graph in the TRT backend build.
To avoid silently failing to build with cuda graph support added a check on the CUDAToolkit_VERSION availability.
Related PR: triton-inference-server/backend@b5dab15
Before
NOTE WARNING: CUDA does not support CUDA graphs.
After
Background
L0_cuda_graph was failing as no cuda graph was being captured. RCA The backend was built with no cuda graph support as With this change the test is passes successfully.