Talk

Open Typeless. Local Typeless. Typeless in your box.

macOS menu bar voice input — hold a hotkey, speak, and your words are recognized, polished, and pasted into the active app. No cloud. No typing.

Download latest Talk · 中文文档 · 开发文档

Features

On-device inference — Apple Silicon + MLX, no cloud, no network after model download
Three ASR engines — MLX local (Qwen3-ASR), Apple Speech Recognition, or Gemma 4 multimodal
Text polishing — Qwen3.5-4B or Gemma 4, removes filler words, adds punctuation, smart formatting
One-pass mode — Set both ASR and LLM to Gemma 4 for single-model speech-to-polished-text
Auto hotword learning — Passively observes your edits, learns ASR corrections via LLM extraction
Selection edit mode — Select text, speak a command ("fix the typo", "make it casual")
Per-app prompt profiles — Different polish styles for Terminal, VSCode, WeChat, etc.
Audio history — Every recording saved as AAC/M4A with ASR context for replay and debugging
Usage statistics — Daily session count, recording duration, error rate, 7-day chart, 90-day retention
Real-time preview — Streaming ASR shows partial transcription as you speak
Floating status indicator — Always-on-top overlay with audio level meter
Customizable hotkey — Key recorder, Push-to-Talk / Toggle modes
Output options — Auto-paste, clipboard-only, or preview window

Quick Start

Download latest release (DMG)
Open the DMG, drag Talk to /Applications
Launch — macOS will prompt for permissions:
- Microphone — for recording
- Input Monitoring — for global hotkey (System Settings → Privacy & Security)
- Accessibility — for auto-pasting text (System Settings → Privacy & Security)
Hold your hotkey (default Fn+A), speak, release — text appears in the active app

Models (~3 GB) download automatically from HuggingFace on first use. Pre-download with make download-models if building from source.

Performance

All inference on-device via Apple Silicon GPU. No network required after model download.

Stage	Latency	Notes
ASR (3-5s audio)	0.07 - 0.18s	17-51× faster than real-time
LLM polish (short)	0.35 - 0.50s	~30 chars input
LLM polish (long)	1.1 - 1.2s	~120 chars input
Full pipeline	~1s	ASR + LLM combined (models warm)
ASR model load	2s	Cold start, one-time
LLM model load	0.6s	Cold start, one-time

Memory usage:

State	RSS
ASR model loaded	~1.6 GB
Both models loaded	~5.4 GB

Full benchmark details: docs/BENCHMARK.md

Compatibility

	Supported	Notes
macOS 26.x (Tahoe)	✅	Built & tested
macOS 15.x (Sequoia)	Likely	MLX dependencies support macOS 14+
macOS 14.x (Sonoma)	Maybe	Minimum for MLX
macOS 13 and below	No	MLX requires macOS 14+
Intel Mac	No	MLX is Apple Silicon only

Requirements

Apple Silicon (M1/M2/M3/M4)
macOS 14.0+ (Sonoma minimum; pre-built DMG targets macOS 26.2+)
16 GB RAM recommended
~3 GB disk space for model files

Models

Model	Size	Purpose
Qwen3-ASR-0.6B-4bit	~400 MB	Speech recognition (MLX)
Qwen3.5-4B-MLX-4bit	~2.8 GB	Text polishing (default LLM)
Gemma 4 4B	—	Multimodal: ASR + LLM in one model
Gemma 4 2B	—	Lightweight multimodal option

Models auto-download from HuggingFace to ~/.cache/huggingface/.

Permissions

On first launch, grant these in System Settings → Privacy & Security:

Microphone — Required for recording. macOS prompts automatically.
Input Monitoring — Required for global hotkey.
Accessibility — Required for auto-pasting text into other apps.

If the hotkey doesn't respond, check Input Monitoring first. Quit and relaunch Talk after enabling.

Vocabulary & Auto Learning

Talk learns from your corrections in two ways:

Passive edit observation — After text injection, Talk monitors the text field. If you edit (e.g., fix a misrecognized word), it detects the change and extracts corrections via a background LLM pass. A ⚡ capsule confirms when new corrections are learned.

Manual — Edit polished text in history view, or add entries in Settings → Personal Vocabulary → Manage Vocabulary. Supports JSON import/export.

Top corrections are injected into the LLM system prompt and applied automatically in future sessions.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 164 Commits
.claude		.claude
Talk.xcodeproj		Talk.xcodeproj
Talk		Talk
TalkTests		TalkTests
docs		docs
scripts		scripts
shared		shared
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
DEV.md		DEV.md
ISSUES.md		ISSUES.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
README_zh.md		README_zh.md
ROADMAP.md		ROADMAP.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Talk

Features

Quick Start

Performance

Compatibility

Requirements

Models

Permissions

Vocabulary & Auto Learning

License

About

Uh oh!

Releases 16

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Talk

Features

Quick Start

Performance

Compatibility

Requirements

Models

Permissions

Vocabulary & Auto Learning

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 16

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages