Attention projections (QKV, O) disaggregation (#1436) #3224
gpu-ci.yml
on: push
Annotations
2 errors and 1 warning
Check Python Interface
This request was automatically failed because there were no enabled runners online to process the request for more than 1 days.
|
Inference Tests
This request was automatically failed because there were no enabled runners online to process the request for more than 1 days.
|
GPU CI Concierge
The following actions use a deprecated Node.js version and will be forced to run on node20: actions/checkout@v3. For more info: https://github.blog/changelog/2024-03-07-github-actions-all-actions-will-run-on-node20-instead-of-node16-by-default/
|