llms · llama-server run helper

Cross-platform wrapper scripts for llama-server that simplify running .gguf models with intelligent configuration management.

Scripts: llms (UNIX/WSL) and llms.ps1 (PowerShell).

Key Features

Optimal Settings Retention: Automatically saves and loads optimal settings for each model.
Flexible Discovery: Discover and run models using partial names.
Priority-based Config: Seamlessly handles CLI arguments, environment variables, and config files.
Dry-run Capability: Preview commands without execution.

Quick Start

Configure Model Directories: Create ~/.config/llms.ini (or %APPDATA%\llms.ini on Windows):
```
ModelsDirs = /path/to/your/models
```

Run a Model:

llms Mistral 64000  # First run: specify context size (saved automatically)
llms Mistral        # Subsequent runs: settings remembered

List Models: llms list

Usage Guide

Command Syntax

llms <partial_name> [<context_size>] [llama-server args...] [--dry-run]

Partial Name: Case-insensitive match against .gguf files.
Context Size: Required for first run, optional thereafter.
Arguments: Any llama-server flag (e.g., --mlock, --n-gpu-layers 30).
Dry Run: Use --dry-run to preview the command without executing or saving config.

Multi-Modal Support

Companion .mmproj files (for multi-modal models) are detected and loaded automatically if they follow a specific naming convention.

Naming Convention: The companion file must be named {BaseName}.mmproj{Suffix}.gguf, where {BaseName} is a prefix of the main model's filename.

Example:

Main Model: Qwen3-VL-30B-A3B-Thinking-UD-Q4.gguf
Companion: Qwen3-VL-30B-A3B-Thinking-UD.mmproj-F16.gguf

The script will automatically find the companion file because Qwen3-VL-30B-A3B-Thinking-UD is the start of the main model's name.

Overriding Settings

Persistent: llms Mistral --n-gpu-layers 30 (saves to .ini)
Temporary (Global): LLMS_PORT=9090 llms Mistral (ENV only)

Configuration System

Parameter Priority

Type	Priority Chain
Per-Model	CLI Args > `.ini` File > Default Values
Server-Wide	Environment Variables > `llms.ini` > Default Values

Note: Since version 1.3.0, both scripts ignore per-model ENV variables (e.g., LLMS_CTX_SIZE) to favor explicit CLI/config settings.

File Locations

Script directory: ./llms.ini
User config: ~/.config/llms.ini (UNIX) or %USERPROFILE%\AppData\Local\llms.ini (Windows)

Boolean Flags

Persisted (Per-Model): --mlock, --no-mmap, --jinja, --cont-batching, etc.
Transient (Server-Wide): --no-webui, --verbose, --dry-run, --help.

Testing

Run Pester tests for PowerShell:

Invoke-Pester ./llms.tests.ps1

Troubleshooting

"No model file found": Check ModelsDirs in llms.ini or run llms list.
"Specify <context_size>": Model has no saved config yet. Run once with a numeric size.
"llama-server not found": Ensure llama-server is in your system $PATH.

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
llms		llms
llms.ps1		llms.ps1
llms.tests.ps1		llms.tests.ps1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

llms · llama-server run helper

Key Features

Quick Start

Usage Guide

Command Syntax

Multi-Modal Support

Overriding Settings

Configuration System

Parameter Priority

File Locations

Boolean Flags

Testing

Troubleshooting

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

srigi/llms

Folders and files

Latest commit

History

Repository files navigation

llms · llama-server run helper

Key Features

Quick Start

Usage Guide

Command Syntax

Multi-Modal Support

Overriding Settings

Configuration System

Parameter Priority

File Locations

Boolean Flags

Testing

Troubleshooting

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages