Skip to content

feat: Introduce K/V Context Quantisation (vRAM improvements) #4477

feat: Introduce K/V Context Quantisation (vRAM improvements)

feat: Introduce K/V Context Quantisation (vRAM improvements) #4477

Triggered via pull request September 13, 2024 03:53
Status Success
Total duration 34m 7s
Billable time 1h 35m
Artifacts

test.yaml

on: pull_request
changes
4s
changes
Matrix: lint
Matrix: test
generate-windows-rocm
19m 19s
generate-windows-rocm
generate-windows-cuda
30m 13s
generate-windows-cuda
Matrix: generate-cuda
Matrix: generate-rocm
Matrix: generate
Fit to window
Zoom out
Zoom in

Annotations

2 warnings
generate-cuda (11.8.0)
The following actions use a deprecated Node.js version and will be forced to run on node20: actions/setup-go@v4. For more info: https://github.blog/changelog/2024-03-07-github-actions-all-actions-will-run-on-node20-instead-of-node16-by-default/
generate-rocm (6.1.2)
The following actions use a deprecated Node.js version and will be forced to run on node20: actions/setup-go@v4. For more info: https://github.blog/changelog/2024-03-07-github-actions-all-actions-will-run-on-node20-instead-of-node16-by-default/