Tapestry Loom

A power user focused interface for LLM base models, inspired by the designs of loom, loomsidian, exoloom, logitloom, and wool.

Screenshots

Known issues

Some documents may cause the text editor to render token boundaries incorrectly
- This is due to a bug in egui regarding textedit underline rendering
Tab bars are not read by screen readers
- This is due to a bug in egui_tiles

If you are experiencing an issue not listed here or in this repository's active issues, please file an issue so that it can be fixed.

Getting started

Important

This application is a work in progress; Please make backups and report any bugs that you find.

Binary releases

Compiled binaries can be found on the releases page.

MacOS-specific instructions

Before using the app, you will need to run the following CLI command in the extracted folder:

xattr -d com.apple.quarantine tapestry*

Compiling from source

Requires the Rust Programming Language and a working C compiler to be installed.

git clone --recurse-submodules https://github.com/transkatgirl/Tapestry-Loom.git
cd Tapestry-Loom
cargo build --release

The compiled binary can be found in the ./target/release/ folder.

Updating

Run the following commands in the repository folder:

git pull
git submodule update --init --recursive
cargo build --release

Usage

See Getting Started for more information on how to use the application.

The rest of this README covers the usage of external tools which Tapestry Loom can interface with.

Migrating weaves from other Loom implementations

See migration-assistant for more information on how to migrate weaves from other Loom implementations to Tapestry Loom.

Local inference

llama.cpp's llama-server is recommended, as it has been confirmed to work properly with all of the features within Tapestry Loom.

Ollama should not be used due to bad sampling settings which cannot be overridden in API requests, along with a lack of available base models.

KoboldCpp is not recommended due to a lack of request queuing and a poor implementation of logprobs (the number of requested logprobs is entirely ignored).

LM Studio is not recommended due to a lack of support for logprobs.

The recommended CLI arguments for llama-server are listed below:

llama-server --models-dir $MODEL_DIRECTORY --models-max 1 --sleep-idle-seconds 1200 --jinja --chat-template "message.content" --ctx-size 4096 --temp 1 --top-k 0 --top-p 1 --min-p 0

Where $MODEL_DIRECTORY is set to the directory where model gguf files are stored.

(Regarding quantization: Benchmarks of how chat models are affected by quantization likely do not generalize to how base models are used. Quantization should be kept as low as reasonably possible, but q8_0 is likely good enough for most use cases.)

Explanation of arguments:

Only one model loaded into VRAM at a time; old models are automatically unloaded to make room for new ones
Models are automatically unloaded after 20 minutes of inactivity
The specified chat template passes user input directly to the model without further changes.
Reducing the maximum context length helps reduce VRAM usage without sacrificing quality.
The default sampling parameters (those specified by the CLI arguments) should leave the model's output distribution unchanged. Sampling parameter defaults for chat models do not generalize to how base models are used.
- The sampling parameters specified in the CLI arguments will be overridden by any sampling parameters that are specified in a request.

Additional useful arguments (depending on your use case):

--no-cont-batching
- Disabling continuous batching significantly improves response determinism at the expense of performance. Should be used if you plan on analyzing logprobs or using greedy sampling.

If you are running llama-server on the same device as Tapestry Loom (and you are using the default port), you do not need to explicitly specify an endpoint URL when filling out the "OpenAI-style Completions" and "OpenAI-style ChatCompletions" templates.

Recommended models

If you are new to working with LLM base models, Trinity-Mini-Base-Pre-Anneal or (Trinity-Nano-Base-Pre-Anneal if you have <32GB of VRAM) is a good first model to try.

Tokenization server (optional)

See tapestry-tokenize for more information on how to configure and use the (optional) tokenization server.

Once a tokenization endpoint is configured for a model, enabling the setting "(Opportunistically) reuse output token IDs" can slightly improve output quality. However, the benefit is largest when generating single-token nodes using non-ASCII characters and a single model (output token IDs cannot be reused across models).

This setting requires the inference backend to support returning token IDs (to check if this is working, hover over generated tokens in the text editor to see if they contain a token identifier). This is a non-standard addition to the OpenAI Completions API which is currently supported by very few inference backends (llama.cpp has been confirmed to work properly with this feature).

If your inference backend returns token IDs in OpenAI-style Completions responses but they do not appear in your weaves, please file an issue.

Plans

At the moment, all major features planned for the initial release have been implemented. Development will slow down for the next few months, as the focus shifts towards fixing bugs and improving documentation.

Development of the next major version of Tapestry Loom will begin in Q1 2026. Please consider donating to help fund further development.

Plans for next major version

Note: Tapestry Loom will be entirely focused on base and/or embedding models for the foreseeable future.

There are already good chat looms (such as miniloom) and base model looms which heavily integrate assistant functionality (such as helm); Tapestry Loom will not be one of them.

Name		Name	Last commit message	Last commit date
Latest commit History 1,255 Commits
.github/workflows		.github/workflows
docs		docs
fonts		fonts
migration-assistant		migration-assistant
src		src
tapestry-tokenize		tapestry-tokenize
universal-weave @ f8adad7		universal-weave @ f8adad7
.gitignore		.gitignore
.gitmodules		.gitmodules
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
Getting Started.md		Getting Started.md
LICENSE		LICENSE
README.md		README.md
llama-server.sh		llama-server.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Repository files navigation

Tapestry Loom

Known issues

Getting started

Binary releases

MacOS-specific instructions

Compiling from source

Updating

Usage

Migrating weaves from other Loom implementations

Local inference

Recommended models

Tokenization server (optional)

Plans

Plans for next major version

Speculative ideas

About

Uh oh!

Releases

Sponsor this project

Uh oh!

Packages

Uh oh!

Languages

Uh oh!

License

transkatgirl/Tapestry-Loom

Folders and files

Latest commit

History

Repository files navigation

Tapestry Loom

Known issues

Getting started

Binary releases

MacOS-specific instructions

Compiling from source

Updating

Usage

Migrating weaves from other Loom implementations

Local inference

Recommended models

Tokenization server (optional)

Plans

Plans for next major version

Speculative ideas

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Sponsor this project

Uh oh!

Packages 0

Uh oh!

Languages

Packages