Releases: yuiseki/llama.cpp
Releases · yuiseki/llama.cpp
b6532
b6360
fix: resolve unsigned int initialization warning for n_dims/size in g…
b6027
HIP: remove the use of __HIP_PLATFORM_AMD__, explicitly support only …
b5985
musa: upgrade musa sdk to rc4.2.0 (#14498) * musa: apply mublas API changes Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com> * musa: update musa version to 4.2.0 Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com> * musa: restore MUSA graph settings in CMakeLists.txt Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com> * musa: disable mudnnMemcpyAsync by default Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com> * musa: switch back to non-mudnn images Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com> * minor changes Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com> * musa: restore rc in docker image tag Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com> --------- Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>
b5753
opencl: ref count `ggml_backend_opencl_context` and refactor profilin…
b5743
CUDA: mul_mat_v support for batch sizes > 1 (#14262) * CUDA: mul_mat_v support for batch sizes > 1 * use 64 bit math for initial offset calculation
b5742
kv-cells : fix tracking of seq_pos (#14339) * kv-cells : fix tracking of seq_pos during cache reuse ggml-ci * cont : improve error message ggml-ci * cont : add more comments
b5734
common : use std::string_view now that we target c++17 (#14319)
b5697
ggml: Add Apple support for GGML_CPU_ALL_VARIANTS (#14258)
b5686
common : suggest --jinja when autodetection fails (#14222)