Tags · xyc/llama.cpp

b2489

cuda : disable host register by default (ggml-org#6206)

Mar 21, 2024
d0a7123
zip
tar.gz

b2487

tests : disable system() calls (ggml-org#6198)

ggml-ci

Mar 21, 2024
924ce1d
zip
tar.gz

b2481

Add ability to use Q5_0, Q5_1, and IQ4_NL for quantized K cache (ggml…

…-org#6183)

* k_cache: be able to use Q5_0

* k_cache: be able to use Q5_1 on CODA

* k_cache: be able to use Q5_0 on Metal

* k_cache: be able to use Q5_1 on Metal

* k_cache: be able to use IQ4_NL - just CUDA for now

* k_cache: be able to use IQ4_NL on Metal

* k_cache: add newly added supported types to llama-bench and CUDA supports_op

---------

Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>

Mar 21, 2024
76aa30a
zip
tar.gz

b2480

Add nvidia and amd backends (ggml-org#6157)

Mar 21, 2024
c5b8595
zip
tar.gz

b2479

cuda : fix conflict with std::swap (ggml-org#6186)

Mar 21, 2024
42e21c6
zip
tar.gz

b2478

cuda : print the returned error when CUDA initialization fails (ggml-…

…org#6185)

Mar 20, 2024
1c51f98
zip
tar.gz

b2476

llava : add MobileVLM_V2 backup (ggml-org#6175)

* Add MobileVLM_V2 backup

* Update MobileVLM-README.md

* Update examples/llava/MobileVLM-README.md

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

* Update examples/llava/convert-image-encoder-to-gguf.py

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

* clip :  fix whitespace

* fix deifinition mistake in clip.cpp

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

Mar 20, 2024
272935b
zip
tar.gz

b2475

cuda : refactor to remove global resources (ggml-org#6170)

* cuda : refactor to remove global resources

Mar 20, 2024
ccf58aa
zip
tar.gz

b2474

Server: version bump for httplib and json (ggml-org#6169)

* server: version bump for httplib and json

* fix build

* bring back content_length

Mar 20, 2024
91f8ad1
zip
tar.gz

b2471

Revert "llava : add a MobileVLM_V2-1.7B backup (ggml-org#6152)"

This reverts commit f8c4e74.

Mar 20, 2024
d795988
zip
tar.gz

PreviousNext

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

b2489

b2487

b2481

b2480

b2479

b2478

b2476

b2475

b2474

b2471

Tags: xyc/llama.cpp