feat: Introduce K/V Context Quantisation (vRAM improvements) #4472
test.yaml
on: pull_request
generate-windows-rocm
28m 37s
generate-windows-cuda
28m 45s
Matrix: generate-cuda
Matrix: generate-rocm
Matrix: generate
Annotations
2 warnings
generate-cuda (11.8.0)
The following actions use a deprecated Node.js version and will be forced to run on node20: actions/setup-go@v4. For more info: https://github.blog/changelog/2024-03-07-github-actions-all-actions-will-run-on-node20-instead-of-node16-by-default/
|
generate-rocm (6.1.2)
The following actions use a deprecated Node.js version and will be forced to run on node20: actions/setup-go@v4. For more info: https://github.blog/changelog/2024-03-07-github-actions-all-actions-will-run-on-node20-instead-of-node16-by-default/
|