Tags · shivamtiwari3/ollama

v0.17.8-rc4

server: remove experimental aliases support (ollama#14810)

Mar 13, 2026
f676231
zip
tar.gz

v0.17.8-rc3

ci: fix missing windows zip file (ollama#14807)

Use 7z compression (better compression rate) if found in path.  That
alone isn't sufficient to get us under 2G, so MLX is now split out as a
discrete download.  Fix CI so it will fail if artifacts fail to upload.

Mar 12, 2026
a6b27d7
zip
tar.gz

v0.17.8-rc2

mlx: perf improvements (ollama#14768)

* mlx: perf improvements

Fix nn.go to call mlx_fast_layer_norm instead of manually implementing (mean,
subtract, variance, rsqrt, multiply, add — 6 ops)

Fix llama.go, gemma3.go to remove RepeatKV to tile K/V tensors to match the Q
head count, since scaled_dot_product_attention natively handles GQA (it just
requires n_q_heads % n_kv_heads == 0)

* review comments

Mar 12, 2026
5397411
zip
tar.gz

v0.17.8-rc1

ci: Fix windows build (ollama#14754)

Instead of relying on sh for wildcard, do it in Go for better windows
compatibility.

Mar 10, 2026
62d1f01
zip
tar.gz

v0.17.8-rc0

MLX: add header vendoring and remove go build tag (ollama#14642)

* prefer rocm v6 on windows

Avoid building with v7 - more changes are needed

* MLX: add header vendoring and remove go build tag

This switches to using a vendoring approach for the mlx-c headers so that Go
can build without requiring a cmake first.  This enables building the new MLX
based code by default.  Every time cmake runs, the headers are refreshed, so we
can easily keep them in sync when we bump mlx versions.  Basic Windows
and Linux support are verified.

* ci: harden for flaky choco repo servers

CI sometimes fails due to choco not actually installing cache.  Since it just speeds up the build, we can proceed without.

* review comments

Mar 10, 2026
10e51c5
zip
tar.gz

v0.17.7

cmd: override stale entries for context window pi (ollama#14655)

Mar 6, 2026
9b0c7cc
zip
tar.gz

v0.17.7-rc2

cmd: override stale entries for context window pi (ollama#14655)

Mar 6, 2026
9b0c7cc
zip
tar.gz

v0.17.7-rc1

cmd/config: fix cloud model limit lookups in integrations (ollama#14650)

Mar 5, 2026
9896e36
zip
tar.gz

v0.17.7-rc0

cmd: add qwen3.5 context length for launch (ollama#14626)

Mar 4, 2026
562c76d
zip
tar.gz

v0.17.6

model: fix renderer and parser for qwen3.5 (ollama#14605)

Mar 4, 2026
82848a7
zip
tar.gz

PreviousNext

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v0.17.8-rc4

v0.17.8-rc3

v0.17.8-rc2

v0.17.8-rc1

v0.17.8-rc0

v0.17.7

v0.17.7-rc2

v0.17.7-rc1

v0.17.7-rc0

v0.17.6

Tags: shivamtiwari3/ollama