Tapestry Loom

A power user focused interface for LLM base models, inspired by the designs of loom, loomsidian, exoloom, logitloom, wool, and mikupad.

Screenshots

Known issues

Some documents may cause the text editor to render token boundaries incorrectly
- This is due to a bug in egui regarding textedit underline rendering
Tab bars are not read by screen readers
- This is due to a bug in egui_tiles
UI state for closed weaves persists in memory after the weave editor UI is closed, creating a slow memory leak
- This is due to a bug in egui
CPU usage is high when the window is not visible and not minimized
- This is due to a bug in egui
Editor subview tooltips take longer to close after scrolling has happened
- This is due to a limitation of egui
Root nodes containing long text may overlap in the canvas view

If you are experiencing an issue not listed here or in this repository's active issues, please file an issue so that it can be fixed.

Getting started

Important

This application is a work in progress; Please make backups and report any bugs that you find.

Binary releases

Compiled binaries can be found on the releases page.

MacOS-specific instructions

Before using the app, you will need to run the following CLI command in the extracted folder:

xattr -d com.apple.quarantine tapestry*

Compiling from source

Requires the Rust Programming Language and a working C compiler to be installed.

git clone --recurse-submodules https://github.com/transkatgirl/Tapestry-Loom.git
cd Tapestry-Loom
cargo build --release

The compiled binary can be found in the ./target/release/ folder.

Updating

Run the following commands in the repository folder:

git pull
git submodule update --init --recursive
cargo build --release

Usage

See Getting Started for more information on how to use the application.

The rest of this README covers the usage of external tools which Tapestry Loom can interface with.

Migrating weaves from other Loom implementations

See migration-assistant for more information on how to migrate weaves from other Loom implementations to Tapestry Loom.

Local inference

llama.cpp's llama-server is recommended, as it has been confirmed to work properly with all of the features within Tapestry Loom (except returning prompt logprobs).

vLLM requires additional request arguments to work properly with Tapestry Loom:

/v1/completions
- return_token_ids = true
  - Optional; Allows (partial) reuse of output token IDs when using Tapestry Tokenize. However, (unlike llama.cpp) token IDs are only returned for the selected token, not for all top_logprobs.
  - Must be removed when using echo = true
/v1/chat/completions
- return_token_ids = true
  - Optional; Allows (partial) reuse of output token IDs when using Tapestry Tokenize. However, (unlike llama.cpp) token IDs are only returned for the selected token, not for all top_logprobs.
- continue_final_message = true
- add_generation_prompt = false

Ollama should not be used due to bad sampling settings which cannot be overridden in API requests, along with a lack of available base models.

KoboldCpp is not recommended due to a lack of request queuing and a poor implementation of logprobs (the number of requested logprobs is entirely ignored).

LM Studio is not recommended due to a lack of support for logprobs.

The recommended CLI arguments for llama-server are listed below:

llama-server --embeddings --models-dir $MODEL_DIRECTORY --models-max 1 --sleep-idle-seconds 1200 --jinja --chat-template "message.content" --ctx-size 4096 --temp 1 --top-k 0 --top-p 1 --min-p 0

Where $MODEL_DIRECTORY is set to the directory where model gguf files are stored.

(Regarding quantization: Benchmarks of how chat models are affected by quantization likely do not generalize to how base models are used. Quantization should be kept as low as reasonably possible, but q8_0 is likely good enough for most use cases.)

Explanation of arguments:

Only one model loaded into VRAM at a time; old models are automatically unloaded to make room for new ones
- If you plan on using an embedding model, you should start a second server instance to avoid swapping out your text generation model when generating embeddings
Models are automatically unloaded after 20 minutes of inactivity
The specified chat template passes user input directly to the model without further changes.
Reducing the maximum context length helps reduce VRAM usage without sacrificing quality.
The default sampling parameters (those specified by the CLI arguments) should leave the model's output distribution unchanged. Sampling parameter defaults for chat models do not generalize to how base models are used.
- The sampling parameters specified in the CLI arguments will be overridden by any sampling parameters that are specified in a request.

Additional useful arguments (depending on your use case):

--no-cont-batching
- Disabling continuous batching significantly improves response determinism at the expense of performance. Should be used if you plan on analyzing logprobs or using greedy sampling.

If you are running llama-server on the same device as Tapestry Loom (and you are using the default port), you do not need to explicitly specify an endpoint URL when filling out the "OpenAI-style Completions" and "OpenAI-style ChatCompletions" templates.

Recommended models

If you are new to working with LLM base models, Trinity-Mini-Base-Pre-Anneal or (Trinity-Nano-Base-Pre-Anneal if you have <32GB of VRAM) is a good first model to try.

If you plan on using seriation, embeddinggemma-300m is a good small embedding model.

Inference providers

Most inference providers support OpenAI-compatible clients and should work with minimal configuration.

However, every inference provider implements OpenAI compatibility in their own way, which may cause unexpected issues. Known issues with popular inference providers are listed below:

OpenRouter
- Logprobs are not supported, even if the underlying provider supports them
  - In addition, some providers on OpenRouter will return errors if logprobs is included as a request argument
Featherless
- Untested; Logprobs are not supported according to documentation

Tokenization server (optional)

See tapestry-tokenize for more information on how to configure and use the (optional) tokenization server.

Once a tokenization endpoint is configured for a model, enabling the setting "(Opportunistically) reuse output token IDs" can slightly improve output quality by giving you more control over tokenization. In this mode, model output token IDs are reused whenever possible and tokenization is performed per-node.

However, the benefit of reusing the model's output tokenization is greatest when generating single-token nodes using non-ASCII characters and a single model (output token IDs cannot be reused across models).

This setting requires the inference backend to support returning token IDs (to check if this is working, hover over generated tokens in the text editor to see if they contain a token identifier). This is a non-standard addition to the OpenAI Completions API which is currently supported by very few inference backends (llama.cpp has been confirmed to work properly with this feature).

If your inference backend returns token IDs in OpenAI-style Completions responses but they do not appear in your weaves, please file an issue.

Development roadmap

Please consider donating to help fund further development.

Roadmap TODOs

Take inspiration from multiverse
Take inspiration from mikupad
Do more experimentation with loom's UI in order to find UX improvements which may be worth including in Tapestry Loom

Milestone 1

Goal: Completion before April 1st, 2026

Milestone 2

Milestone 3

Milestone 4

Milestone 5

Allow changing default editor subview layout
Perform UX testing with all built-in color schemes
Review and refactor application modules
- settings
- editor
Optimize performance whenever reasonably possible
Support opening weaves using CLI arguments to tapestry loom
Review and refactor main module

Milestone 6

Goal: Completion before June 1st, 2026

Milestone 7

Milestone 8

Milestone 9

Perform heavy unit testing and/or formal verification of universal-weave to prevent bugs that could result in data loss
Release universal-weave version 1.0.0
Write unit tests for response parser
Release Tapestry Loom version 1.0.0-rc.1

Post-v1 plans

The below items may be implemented in a 1.x release, or they may be delayed to be implemented in a future 2.x release.

Name		Name	Last commit message	Last commit date
Latest commit History 1,686 Commits
.github/workflows		.github/workflows
docs		docs
fonts		fonts
migration-assistant		migration-assistant
src		src
tapestry-tokenize		tapestry-tokenize
tapestry-weave		tapestry-weave
universal-weave @ 42ab0ea		universal-weave @ 42ab0ea
.gitignore		.gitignore
.gitmodules		.gitmodules
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
Getting Started.md		Getting Started.md
LICENSE		LICENSE
README.md		README.md
build.rs		build.rs
llama-server.sh		llama-server.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Tapestry Loom

Known issues

Getting started

Binary releases

MacOS-specific instructions

Compiling from source

Updating

Usage

Migrating weaves from other Loom implementations

Local inference

Recommended models

Inference providers

Tokenization server (optional)

Development roadmap

Roadmap TODOs

Milestone 1

Milestone 2

Milestone 3

Milestone 4

Milestone 5

Milestone 6

Milestone 7

Milestone 8

Milestone 9

Post-v1 plans

Speculative ideas

About

Uh oh!

Releases

Sponsor this project

Uh oh!

Packages

Uh oh!

Languages

Uh oh!

License

transkatgirl/Tapestry-Loom

Folders and files

Latest commit

History

Repository files navigation

Tapestry Loom

Known issues

Getting started

Binary releases

MacOS-specific instructions

Compiling from source

Updating

Usage

Migrating weaves from other Loom implementations

Local inference

Recommended models

Inference providers

Tokenization server (optional)

Development roadmap

Roadmap TODOs

Milestone 1

Milestone 2

Milestone 3

Milestone 4

Milestone 5

Milestone 6

Milestone 7

Milestone 8

Milestone 9

Post-v1 plans

Speculative ideas

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Sponsor this project

Uh oh!

Packages 0

Uh oh!

Languages

Packages