Tags · ArielleTolome/llmfit

v0.9.1

fix: query local providers in `recommend` CLI command

Populates `fit.installed` in CLI `recommend` output (text + JSON) by
probing Ollama, MLX, llama.cpp, Docker Model Runner, and LM Studio —
same behavior as the TUI. Honors DOCKER_MODEL_RUNNER_HOST so the DMR
backend receives requests from non-interactive CLI usage too.

Fixes docker/model-runner#747 feedback.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Apr 5, 2026
2477d1d
zip
tar.gz

v0.9.0

chore: cargo format

Signed-off-by: Alex Jones <alexsimonjones@gmail.com>

Apr 5, 2026
839595d
zip
tar.gz

v0.8.9

fix: prefer discrete GPU over integrated on Windows (AlexsJones#303)

On Windows systems with both an integrated (e.g. Intel UHD) and a
discrete GPU (e.g. NVIDIA), the WMI AdapterRAM 32-bit cap could cause
the integrated GPU to report higher VRAM and win the sort, becoming
the primary GPU incorrectly.

Added `prefer_discrete_gpus` filtering that drops integrated GPUs when
at least one discrete GPU is present. On iGPU-only systems the
integrated GPU is kept as before. Integrated GPUs are identified by
name patterns (Intel UHD/HD/Iris, AMD Radeon Graphics without a
discrete model identifier).

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Apr 4, 2026
3312a40
zip
tar.gz

v0.8.8

feat: add Google Gemma 4 models and fix Gemma 3 capabilities

Cherry-picked from AlexsJones#310 (credit: @shaal). Adds Gemma 4 models
(E2B-it, E4B-it, 31B-it, 26B-A4B-it), fixes MoE detection for
top_k_experts, adds any-to-any vision pipeline tag, and enables
tool_use + vision for Gemma 3/4 instruction-tuned models.

Version bump to 0.8.8.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Apr 4, 2026
2cb21c3
zip
tar.gz

v0.8.7

chore: fixed gguf filter regression

Signed-off-by: Alex Jones <alexsimonjones@gmail.com>

Apr 3, 2026
ac41ec6
zip
tar.gz

v0.8.6

chore: version bump

Signed-off-by: Alex Jones <alexsimonjones@gmail.com>

Apr 1, 2026
6241c8e
zip
tar.gz

v0.8.5

chore: updated

Signed-off-by: Alex Jones <alexsimonjones@gmail.com>

Mar 27, 2026
27c6401
zip
tar.gz

v0.8.4

chore: updated version

Signed-off-by: Alex Jones <alexsimonjones@gmail.com>

Mar 22, 2026
5f0c13b
zip
tar.gz

v0.0.2

chore: updated fmt

Signed-off-by: Alex Jones <alexsimonjones@gmail.com>

Mar 22, 2026
af99f20
zip
tar.gz

v0.8.2

fix: iGPU inflating GPU count and force-runtime being ignored (AlexsJ…

…ones#271)

rocm-smi reports all GPU agents including iGPUs on APUs like the Ryzen
9800X3D, inflating the discrete GPU count and total VRAM. Filter out
VRAM entries below 2 GB so only discrete GPUs are counted.

Also fix --force-runtime being silently ignored for pre-quantized
(AWQ/GPTQ) models by checking the override before the prequantized
default.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Mar 21, 2026
f584d7e
zip
tar.gz

PreviousNext

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v0.9.1

v0.9.0

v0.8.9

v0.8.8

v0.8.7

v0.8.6

v0.8.5

v0.8.4

v0.0.2

v0.8.2

Tags: ArielleTolome/llmfit