pattern: run lexer in a lightweight coroutine by adonovan · Pull Request #1690 · dominikh/go-tools

adonovan · 2025-12-10T12:23:30Z

Previously, the lexer ran as a separate goroutine that fed a channel with items. This change expresses the lexer as an iter.Seq[item], and then uses the iter.Pull mechanism to run it as a lightweight coroutine. This reduces the running time of the added benchmark by about 32%.

$ go test -benchtime=10s -bench=. ./pattern

before
BenchmarkParser-8 649060 19051 ns/op
after
BenchmarkParser-8 908734 12971 ns/op

Interestingly, the total time spent by gopls in Parse, measured by bracketing Parse with time.Now/time.Time.Sub calls, consistently reduces by a factor of around 20x (down to 800us, originally 15ms). I can't yet explain this discrepancy.

Previously, the lexer ran as a separate goroutine that fed a channel with items. This change expresses the lexer as an iter.Seq[item], and then uses the iter.Pull mechanism to run it as a lightweight coroutine. This reduces the running time of the added by about 32%. $ go test -benchtime=10s -bench=. ./pattern before BenchmarkParser-8 649060 19051 ns/op after BenchmarkParser-8 908734 12971 ns/op Interestingly, the total time spent by gopls in Parse, measured by bracketing Parse with time.Now/time.Sub calls, consistently reduces by a factor of around 20x (down to 800us, originally 15ms). I can't yet explain this discrepancy.

dominikh · 2025-12-11T20:59:56Z

Those gopls timings are quite worrying...

As for the change itself, maybe we should just have the lexer return a slice of tokens? Patterns are small, we only parse them at program startup, and the [cg]oroutines probably don't save enough garbage to justify the added complexity.

adonovan · 2025-12-12T13:38:34Z

Those gopls timings are quite worrying...

As for the change itself, maybe we should just have the lexer return a slice of tokens? Patterns are small, we only parse them at program startup,

That's a possibility. Especially if the tokens themselves could be made smaller by not including the string, only its start/end offsets into the content string shared by lexer and parser, something like:

type item struct {
	typ            uint8
	start, end int32
}

and the [cg]oroutines probably don't save enough garbage to justify the added complexity.

I felt the coroutines actually reduced complexity (a small amount) by reconsidering the lexer as just the implementation details of an iterator; and the parser's nextToken function exactly matches what iter.Pull gives us.

pattern/parser.go

dominikh reviewed Dec 30, 2025

View reviewed changes

pattern/parser.go Outdated Show resolved Hide resolved

Update pattern/parser.go

4696066

dominikh closed this in bf66652 Dec 30, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

pattern: run lexer in a lightweight coroutine#1690

pattern: run lexer in a lightweight coroutine#1690
adonovan wants to merge 2 commits intodominikh:masterfrom
adonovan:lex-coroutine

adonovan commented Dec 10, 2025 •

edited

Loading

Uh oh!

dominikh commented Dec 11, 2025

Uh oh!

adonovan commented Dec 12, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

adonovan commented Dec 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dominikh commented Dec 11, 2025

Uh oh!

adonovan commented Dec 12, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

adonovan commented Dec 10, 2025 •

edited

Loading