Attention projections (QKV, O) disaggregation #2529
Annotations
1 warning
The following actions use a deprecated Node.js version and will be forced to run on node20: actions/checkout@v3, conda-incubator/setup-miniconda@v2. For more info: https://github.blog/changelog/2024-03-07-github-actions-all-actions-will-run-on-node20-instead-of-node16-by-default/
|
Loading