Parse input_audio message content by tamnd · Pull Request #47 · tamnd/gomlx

tamnd · 2026-06-04T10:14:52Z

What

Audio reaches a chat message the same way images do: as a typed content part.
The OpenAI format carries it inline as base64 under an input_audio part with a
format field, rather than by URL.

InputAudio is added to ContentPart.
Message.AudioRefs decodes each clip's base64 into bytes paired with its
format.
Message.HasAudio is the cheap check for whether a message carries audio.

Text flattening continues to ignore non-text parts, so audio rides along on the
message for a transcription stage to read instead of being dropped or leaking
into the prompt text.

Scope

This is the api parsing layer, mirroring the earlier image content change. Pure
Go and fully unit tested: base64 decoding with the format preserved, rejection
of a malformed payload, skipping empty or missing audio, the HasAudio check, and
that TextContent still ignores audio parts. Decoding the audio container and
running speech-to-text land later; this makes the audio bytes available instead
of discarding them.

Test

go test ./... green.
go vet ./... clean.

Audio arrives in a chat message the same way images do: as a typed content part. OpenAI carries it inline as base64 under an input_audio part with a format field. Add the InputAudio type to ContentPart and AudioRefs on Message to decode the clips out, alongside HasAudio for a cheap check. Text flattening still ignores non-text parts, so audio rides along on the message for a transcription stage to read rather than being dropped or leaking into the prompt text. A payload that does not decode is reported as an error.

tamnd merged commit 80b4d7a into main Jun 4, 2026
1 check passed

tamnd deleted the audio-content branch June 4, 2026 10:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parse input_audio message content#47

Parse input_audio message content#47
tamnd merged 1 commit into
mainfrom
audio-content

tamnd commented Jun 4, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

tamnd commented Jun 4, 2026

What

Scope

Test

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant