Skip to content

Releases: yuiseki/llama.cpp

b6532

21 Sep 23:26
c4510dc

Choose a tag to compare

opencl: initial `q8_0` mv support (#15732)

b6360

02 Sep 22:59
3de0082

Choose a tag to compare

fix: resolve unsigned int initialization warning for n_dims/size in g…

b6027

29 Jul 23:19
aa79524

Choose a tag to compare

HIP: remove the use of __HIP_PLATFORM_AMD__, explicitly support only …

b5985

25 Jul 04:09
3f4fc97

Choose a tag to compare

musa: upgrade musa sdk to rc4.2.0 (#14498)

* musa: apply mublas API changes

Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>

* musa: update musa version to 4.2.0

Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>

* musa: restore MUSA graph settings in CMakeLists.txt

Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>

* musa: disable mudnnMemcpyAsync by default

Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>

* musa: switch back to non-mudnn images

Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>

* minor changes

Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>

* musa: restore rc in docker image tag

Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>

---------

Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>

b5753

25 Jun 14:23
73e53dc

Choose a tag to compare

opencl: ref count `ggml_backend_opencl_context` and refactor profilin…

b5743

23 Jun 12:31
defe215

Choose a tag to compare

CUDA: mul_mat_v support for batch sizes > 1 (#14262)

* CUDA: mul_mat_v support for batch sizes > 1

* use 64 bit math for initial offset calculation

b5742

23 Jun 11:42
7b50d58

Choose a tag to compare

kv-cells : fix tracking of seq_pos (#14339)

* kv-cells : fix tracking of seq_pos during cache reuse

ggml-ci

* cont : improve error message

ggml-ci

* cont : add more comments

b5734

22 Jun 12:08
40bfa04

Choose a tag to compare

common : use std::string_view now that we target c++17 (#14319)

b5697

18 Jun 13:26
ef03580

Choose a tag to compare

ggml: Add Apple support for GGML_CPU_ALL_VARIANTS (#14258)

b5686

17 Jun 00:25
e434e69

Choose a tag to compare

common : suggest --jinja when autodetection fails (#14222)