[CUDA] Add cupti #128
cuda.yml
on: pull_request
linux-arm64
33m 23s
linux-x86_64
26m 50s
windows-x86_64
56m 15s
redeploy
6s