Implement MinLZ Stream Search Tables by klauspost · Pull Request #35 · minio/minlz

klauspost · 2026-03-23T10:52:27Z

Implements - and includes #31

Allows for text search inside compressed data.

Default settings, length = 5:

$ time rg -c "1679893864\\.789383868" cockroach.node1.log
1

real    0m47.068s
user    0m0.512s
sys     0m10.832s

$ time ./mz search -v -c "1679893864.789383868" cockroach.node1.log.mz
1
(cockroach.node1.log.mz took 169ms 62169.4 MB/s. Table bits/byte: 0.0651)

real    0m0.258s
user    0m0.000s
sys     0m0.003s

Full stats:

Blocks total: 5010, skipped: 4932 (98.4%), deferred: 3296 (65.8%, 3230 skipped)
Blocks searched: 78 (1.6%), false positive: 77 (98.7%)
Bytes skipped: 585450132 compressed, searched: 163577856 uncompressed
Tables: 5010 present, 0 missing, 0 unusable
Table bits/byte: 0.0651, log2: 17.0, avg reductions: 4.0
Table total: 85531792 bytes, avg 17072 bytes/table, 0.81% of 10506623721 uncompressed
Table population: avg 34.8%, min 29.5%, max 50.0%

Stream Searching

Introduction

MinLZ streams can include optional per-block hash tables that allow searching
compressed data without decompressing every block. When a block's table indicates the
search pattern is definitely absent, the block is skipped entirely — via io.Seeker
if available (single syscall), or by buffering past the compressed data.

Streams with search tables remain fully backward-compatible: readers that don't
understand the table chunks silently skip them, and the compressed data is unchanged.

Search tables are generated during compression with zero impact on the compressed data
itself — they are stored as additional skippable chunks interleaved between blocks.

For format specification see SPEC_SEARCH.md.

How It Works

Each block's search table is a bit array where each position in the uncompressed data
is hashed and the corresponding bit is set. When searching, the pattern's byte windows
are hashed and checked against the table:

If any window hash is not set: the block definitely doesn't contain the pattern — skip it.
If all window hashes are set: the block might contain the pattern — decode and search.

Tables are reduced (halved) by OR-folding the upper and lower halves, trading accuracy
for size. Reductions are applied per-block based on population density.

Longer search patterns produce more window checks, giving exponentially better
filtering. For example, a 19-byte pattern with matchLen=8 produces 12 window
checks — all 12 must match for a false positive, which is extremely unlikely
with typical table populations of 10–30%.

There is a limit to table sizes at 1 bit per byte. This means that at most the tables will b 1/8th of the
uncompressed stream size - and for these tables a maximum population count - default at 70%.
If the 8:1 table is filled more than this they will not be saved to the stream.

This means that blocks with near-random data will not have any tables
and searching will have to fall back to decompression.

MinLZ will not attempt to generate tables for incompressible blocks.

Parameters

Match Length

Controls how many bytes of each position are hashed into the table. Range: 1–8,
default: 6.

Lower values (e.g. 4) hash fewer bytes per position, making the table useful for
shorter search patterns. However, shorter hashes collide more, increasing table
population.
Higher values (e.g. 8) hash more bytes per position, giving fewer collisions.
However, a search pattern of length N only produces N - matchLen + 1 hash windows
to check. Fewer windows means fewer independent chances to prove a block doesn't
contain the pattern, which can reduce skip rates. Higher match lengths also produce
lower base population, which means fewer reductions and larger tables on disk.

The match length must be less than or equal to the search pattern length. Patterns
shorter than the match length cannot use the table (the searcher falls back to full
decode).

A good default is 6: it balances table density against the number of check windows.
Use 4 for short patterns (e.g. short IDs), but be aware that short windows from common
character classes (digits, hex, lowercase) will appear in nearly every block, collapsing
skip rates. For example, searching numeric data with matchLen=4 can drop skip rates to
single digits because 4-byte digit sequences are ubiquitous.

cfg := minlz.NewSearchTableConfig().WithMatchLen(5)

Table Max Population Size

Maximum percentage of bits that may be set in the base table before it is discarded
entirely. Default: 70%.

When a block's data is highly random or the match length is short, most hash slots get
filled and the table loses its ability to prove absence. Tables exceeding this threshold
are dropped — the block will always be decoded during search.

Lowering this value makes the compressor more aggressive about discarding noisy tables,
reducing overhead at the cost of fewer indexed blocks. Raising it keeps more tables but
with higher false-positive rates.

cfg := minlz.NewSearchTableConfig().WithMaxPopulation(50)

Table Reduction Limit

Maximum population percentage of the reduced table. Library default: 25%.
The mz CLI defaults to 50% without prefix and 25% with prefix.

After the base table is built, it is iteratively halved by folding (OR-ing) the upper
half into the lower half. Each reduction halves the table size but increases the
population density. Reductions stop before this threshold is exceeded.

Lower values produce smaller tables (fewer bytes per block in the compressed stream) but
with more false positives. Higher values keep larger, sparser tables that skip more blocks.

cfg := minlz.NewSearchTableConfig().WithMaxReducedPopulation(30)

Prefixes

Prefix filtering dramatically reduces table size for structured data by only indexing
positions that follow specific bytes. For example, in JSON data, values always follow
" or :, so most byte positions can be skipped during indexing.

Since fewer positions are indexed, the base table population is much lower, allowing
more reductions and producing significantly smaller tables on disk. The downside is that
search patterns must contain at least one prefix byte (or match the long prefix) for
the table to be usable. Patterns without any prefix bytes fall back to full block decode.

There are two prefix modes:

Single byte

Single-byte prefixes only indexes positions preceded by one of these bytes.
WithBytePrefix accepts up to 8 bytes directly; for more than 8, use WithMaskPrefix
with a 256-bit bitmask. Both produce the same result.

cfg := minlz.NewSearchTableConfig().WithBytePrefix('"', ':')

Long prefix

Long prefix only indexes positions preceded by an exact multi-byte sequence (1–256 bytes):

cfg := minlz.NewSearchTableConfig().WithLongPrefix([]byte(`id:"`))

The searcher scans the search pattern for prefix bytes anywhere inside it. For example,
searching for "unique-9876" with byte prefix " works because " appears at position 0
in the pattern. The table is consulted for the hash window that follows each prefix
occurrence in the pattern.

When no prefix bytes appear in the search pattern, the table cannot be used and the
searcher falls back to full block decode.

Choosing good prefix bytes

Pick bytes that immediately precede the values you'll search for:

JSON data: " and : — values always follow ": or :[.
CSV data: , or \t — field separators.
Key=value formats: = precedes values.
Log lines with fields: space, tab, =, or :.

Commandline

The mz tool supports search table generation during compression and pattern search
on compressed files.

Compression with search tables:

mz c -search file.log                              # default matchLen=6, no prefix
mz c -search -search.len=4 file.log                # matchLen=4
mz c -search -search.prefixes='":'  file.log       # byte prefixes " and :
mz c -search -search.prefix='id:"' file.log        # long prefix
mz c -search -search.max=50 -search.lim=30 file.log # custom population limits
mz c -bs=1MB -search file.log                       # 1MB blocks (more granular skipping)

Searching compressed files:

mz search "pattern" file.log.mz                     # print matching lines
mz search -c "pattern" file.log.mz                  # count matching lines
mz search -n "pattern" file.log.mz                  # print with line numbers
mz search -v "pattern" file.log.mz                  # verbose: print stats after search
mz search -q "pattern" file.log.mz                  # quiet: exit code only (0=found, 1=not)
mz search -bail "pattern" file.log.mz               # error if tables are missing/unusable

When tables are absent, the searcher decodes every block (equivalent to decompress + grep).
With tables present, only blocks that might contain the pattern are decoded.

API reference

Compression

Enable search tables by passing WriterSearchTable as a writer option:

// Default configuration (matchLen=6, no prefix).
cfg := minlz.NewSearchTableConfig()
w := minlz.NewWriter(output, minlz.WriterSearchTable(cfg))

// With byte prefixes for structured data.
cfg := minlz.NewSearchTableConfig().
    WithMatchLen(4).
    WithBytePrefix('"', ':')
w := minlz.NewWriter(output, minlz.WriterSearchTable(cfg))

// With long prefix for field-specific indexing.
cfg := minlz.NewSearchTableConfig().
    WithLongPrefix([]byte(`"id":"`))
w := minlz.NewWriter(output, minlz.WriterSearchTable(cfg))

Tables are generated concurrently alongside compression with no extra goroutine
synchronization overhead. The Writer handles all table generation and chunk
serialization automatically.

Configuration methods:

Method	Description
`NewSearchTableConfig()`	Create config with defaults (matchLen=6, no prefix)
`WithMatchLen(n)`	Set match length 1–8
`WithBytePrefix(b...)`	Set 1–8 prefix bytes (>8 auto-promotes to bitmask)
`WithMaskPrefix(mask)`	Set a 256-bit prefix bitmask
`WithLongPrefix(p)`	Set a multi-byte prefix (1–256 bytes)
`WithMaxPopulation(pct)`	Discard tables above this population % (default 70)
`WithMaxReducedPopulation(pct)`	Stop reducing above this population % (default 25)

Decompressing the stream will ignore the search tables.

Searching

Search compressed streams using BlockSearcher:

searcher := minlz.NewBlockSearcher(input)
err := searcher.Search([]byte("pattern"), func(r minlz.SearchResult) error {
    fmt.Printf("match at stream offset %d\n", r.StreamOffset)
    return nil
})

The callback receives a SearchResult for each match:

Field	Type	Description
`Blocks`	`[2][]byte`	`[0]` = previous block (nil if skipped/lazy), `[1]` = current block
`Offset`	`int`	Match position relative to `PrevBlock()` + `Blocks[1]`
`StreamOffset`	`int64`	Absolute byte offset in the uncompressed stream
`BlockStart`	`int64`	Stream offset of `PrevBlock()` data
`PrevBlockLen`	`int`	Decompressed size of previous block (avoids lazy decode)

Methods on SearchResult:

PrevBlock() []byte — Returns the previous block's data. Lazily decompresses if the
previous block was skipped by the index. Returns nil if no previous block exists.

Return values from the callback:

nil — continue searching
ErrSearchForward — request the next block for forward context; the searcher will
re-call the callback with the same match but Blocks[1] replaced by the next block
any other error — abort the search

Searcher options:

Option	Description
`BlockSearchBailOnMissing()`	Return `ErrSearchTablesUnusable` if tables are absent or incompatible
`BlockSearchIgnoreCRC()`	Skip CRC validation during search
`BlockSearchMaxBlockSize(n)`	Limit maximum decoded block size

After Search returns, call Stats() for a SearchStats struct with block counts,
skip rates, table population metrics, and byte-level statistics. Use
stats.Fprint(os.Stderr) for a human-readable summary.

Note that the maximum backreference on matches is limited by the block size.
So a match right after a block boundary will only have the previous block's data available.

Spec: SPEC_SEARCH.md with Appendix A on searcher lookup strategies.

Implements - and includes minio#31 Allows for text search inside compressed data. # Block Search Tables ## What it does MinLZ streams can include optional per-block bloom filter tables that enable searching compressed data without decompressing every block. Blocks that definitely don't contain the search pattern are skipped entirely via `io.Seeker` (single syscall) or buffered read. ## Encoding ```go cfg := minlz.NewSearchTableConfig() // matchLen defaults to 6 cfg = cfg.WithBytePrefix('"', ':') // optional: only index after these bytes w := minlz.NewWriter(output, minlz.WriterSearchTable(cfg)) ``` - `NewSearchTableConfig()` -- no arguments, defaults to matchLen=6 - `WithMatchLen(n)` -- override match length (1-8) - `WithBytePrefix(bytes...)` -- type 2 (<=8 bytes) or auto-promotes to type 3 (bitmask) for >8 - `WithMaskPrefix(mask)` -- type 3 (256-bit bitmask) - `WithLongPrefix(prefix)` -- type 4 (multi-byte prefix, 1-256 bytes) - `WithMaxPopulation(pct)` -- skip table if >N% bits set (default 70) - `WithMaxConflicts(pct)` -- stop reducing at >N% conflicts (default 25) Tables are generated in compression goroutines with zero extra channel overhead (single result per block). Only compressible blocks get tables. Empty tables (prefix not found in block) are reduced to minimum 32 bytes -- still useful for proving absence. Base table size = 1 bit per uncompressed byte, automatically derived from block size. Per-block reductions adapt based on population/conflict thresholds. ## Searching ```go searcher := minlz.NewBlockSearcher(input) err := searcher.Search([]byte("pattern"), func(r minlz.SearchResult) bool { // r.Data = decoded block, r.BlockStart = uncompressed offset // r.PrevBlock() = previous decoded block (nil if skipped/first) return true // continue }) stats := searcher.Stats() stats.Fprint(os.Stderr) ``` - Uses `io.Seeker` when available for O(1) block skipping - For prefix tables, scans the search pattern for prefix bytes **anywhere inside it** -- `stamp":"1679` works with prefix `"` because `"` appears at position 5 - Empty prefix tables (all zeros) correctly skip blocks where the prefix byte never appears - Falls back to full decode when tables are absent or incompatible - `BlockSearchBailOnMissing()` to error instead of falling back - `SearchResult` struct with `PrevBlock()` for boundary matching ## Stats ``` Blocks total: 2505, skipped: 2331, searched: 174 Skip rate: 93.1% Table bits/byte: 0.0041, log2: 8.1, avg reductions: 13.9 Table total: 5368528 bytes, avg 2143 bytes/table, 0.05% of 10506623721 uncompressed Table population: avg 21.3%, min 0.0%, max 46.9% ``` ## CLI (mz) ``` mz c -search=8 -search.prefixes='":' file.log # byte prefixes mz c -search=4 -search.prefix='id:"' file.log # long prefix (single byte auto-promotes) mz search -v "pattern" file.log.mz # search with stats mz search -l -n "pattern" file.log.mz # line mode with line numbers mz search -c "pattern" file.log.mz # count only mz search -q "pattern" file.log.mz # exit code only ``` ## Performance (10GB cockroach log, Ryzen 9 9950X) | Config | Skip rate | Search time | Throughput | Table overhead | |--------|-----------|-------------|------------|----------------| | matchLen=8, prefix `":` | 99.9% | 144ms | 55 GB/s | 0.05% | | matchLen=4, prefix `":` | 93.1% | 191ms | 44 GB/s | 0.05% | | matchLen=4, prefix absent | 100% | 17ms | 618 GB/s | 0.001% | | matchLen=8, no prefix | 99.9% | 144ms | 55 GB/s | 2.1% | **Indexing**: ~1.2 GB/s no-prefix, ~2.1 GB/s with prefix (single core) **Pattern lookup**: 37ns, zero allocations ## Wire format Two new skippable chunk types (backward compatible -- old readers silently skip them): - `0x44` -- Search table info (per-stream): table type, match length, base size, prefixes - `0x45` -- Block search table (per-block): reductions + bit array Spec: [SPEC_SEARCH.md](SPEC_SEARCH.md) with Appendix A on searcher lookup strategies.

klauspost · 2026-03-27T19:01:24Z

(still fine tuning)

…nto implement-block-search

klauspost added 15 commits January 19, 2026 16:19

Add block search index spec

f7830a8

Merge branch 'main' into block-search

d06dfdb

Spelling, grammar

5252fc4

Merge branch 'main' into block-search

06ea0b8

Merge branch 'main' into block-search

552e267

Clarifications

9c5d955

Add table type 4.

71b2fc7

Merge branch 'main' into implement-block-search

75dea40

Bump CI go versions

710c067

"stash" before replimplementing search

bf447a0

Rewrite search to be easier to use.

bc3a6d0

fmt

d190cbe

Be more efficient at searching.

0370622

Do indexing on byte arrays, unroll loops.

540b4e9

klauspost force-pushed the implement-block-search branch from 3b116f4 to 540b4e9 Compare March 24, 2026 20:37

klauspost added 4 commits March 26, 2026 09:38

Updates and optimizations

55b2328

Improve search and reductions

380d063

Improve search

c482a40

Avoid copy when counting lines.

b09256f

harshavardhana approved these changes Mar 27, 2026

View reviewed changes

Tweaks, speed improvements and docs.

1f04e8a

klauspost changed the title ~~Implement MinLZ Block Search Tables~~ Implement MinLZ Stream Search Tables Mar 31, 2026

Respect nounsafe tag and remove allocs from index building.

f20fbf9

klauspost mentioned this pull request Mar 31, 2026

rfc: "Block Search" Specification #31

Closed

klauspost added 4 commits April 6, 2026 16:28

Make stats collection optional

eb3bcc5

Merge branch 'main' into implement-block-search

c99a4bb

Add example

a49c691

Merge branch 'implement-block-search' of github.com:klauspost/minlz i…

d4376b3

…nto implement-block-search

Enable QEMU

5f205a1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement MinLZ Stream Search Tables#35

Implement MinLZ Stream Search Tables#35
klauspost wants to merge 26 commits into
minio:mainfrom
klauspost:implement-block-search

klauspost commented Mar 23, 2026 •

edited

Loading

Uh oh!

klauspost commented Mar 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

klauspost commented Mar 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Stream Searching

Introduction

How It Works

Parameters

Match Length

Table Max Population Size

Table Reduction Limit

Prefixes

Single byte

Long prefix

Choosing good prefix bytes

Commandline

API reference

Compression

Searching

Uh oh!

klauspost commented Mar 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

klauspost commented Mar 23, 2026 •

edited

Loading