Skip to content

feat: Introduce K/V Context Quantisation (vRAM improvements) #4472

feat: Introduce K/V Context Quantisation (vRAM improvements)

feat: Introduce K/V Context Quantisation (vRAM improvements) #4472

Triggered via pull request September 12, 2024 20:41
Status Success
Total duration 34m 46s
Billable time 1h 44m
Artifacts

test.yaml

on: pull_request
changes
4s
changes
Matrix: lint
Matrix: test
generate-windows-rocm
28m 37s
generate-windows-rocm
generate-windows-cuda
28m 45s
generate-windows-cuda
Matrix: generate-cuda
Matrix: generate-rocm
Matrix: generate
Fit to window
Zoom out
Zoom in

Annotations

2 warnings
generate-cuda (11.8.0)
The following actions use a deprecated Node.js version and will be forced to run on node20: actions/setup-go@v4. For more info: https://github.blog/changelog/2024-03-07-github-actions-all-actions-will-run-on-node20-instead-of-node16-by-default/
generate-rocm (6.1.2)
The following actions use a deprecated Node.js version and will be forced to run on node20: actions/setup-go@v4. For more info: https://github.blog/changelog/2024-03-07-github-actions-all-actions-will-run-on-node20-instead-of-node16-by-default/