How It Works

Three steps. That's it.

⌘

Deploy the latest

build to staging and

run integration tests

Simple: Hold ⌘, speak, text appears in any app

⌘

hold

tap

🎤 You say:

"ugh, login is broken again, users can't get in through OAuth, something with the callback, please just fix it"

✨ Transform

## Problem

OAuth login flow broken —

callback returns 500

## Action Plan

1. Reproduce: login → OAuth → 500

2. Check /auth/callback logs

3. Fix token exchange logic

4. Add e2e test for OAuth flow

5. Verify on staging

## Priority: P0 — users blocked

🤖

Remote Claude Code

remote agent

Fix OAuth callback error in auth server

Routing: Hold ⌘, tap 1 → Transform → Send to AI agent

Why SpeechButton

Everything runs on your Mac. No accounts, no servers, no cloud.

🔒

100% Private & Local

Your voice never leaves your Mac. Speech recognition runs on the Apple Neural Engine via CoreML. AI transforms also run locally. No internet required.

🌍

Multi-Language Support

Multiple speech models to choose from. Fast English recognition or 100+ languages. Auto-detect or set your language explicitly.

⚡

7ms Instant Capture

Press the key — recording starts in 7 milliseconds. Other apps take 200ms+ and lose the first word. SpeechButton captures everything from the very first syllable.

🛠

AI-Ready Text Config

All settings in a plain config.toml file. Hand it to an AI agent — it configures everything for you. Per-hotkey channel bindings, transform pipelines, device rules. Professional-grade control without a GUI.

Trusted by developers and creators

🤖Claude Code

💬Slack

✅Linear

💻VS Code

📝Notion

🖥Terminal

★★★★★

"With other apps I kept losing the first word and had to repeat myself. SpeechButton captures everything from the first syllable. Saved me 2 hours a day on documentation alone."

Marcus Torres
Senior Software Engineer, Stripe

★★★★★

"I walk around the room and code by voice. My back pain is gone. Wrote 40% more code last month just by dictating instead of typing. Development feels completely different now."

James Kim
Full-Stack Developer, Linear

★★★★★

"I manage 5 AI agents just by talking. Hold a key, give a task, release. Cut context-switching time by 70%. The channel routing is a game changer for multi-agent workflows."

Sarah Liu
AI Ops Lead, Vercel

Simple Pricing

Start free. Upgrade when you need more.

Free

forever

✓ 15 min/day transcription
✓ Multiple speech models
✓ 100% local & private
✓ Push-to-talk & VAD
✓ Auto-paste & auto-send

Download Free

Best Value — Save 27%

Pro Yearly

$5.83 /month

$69.99 billed annually

You save $26/year vs monthly

✓ Unlimited transcription
✓ Everything in Free
✓ Hotkey channels & routing
✓ AI transforms (local + cloud)
✓ Priority support

Start Free Trial

Pro Monthly

$7.99 /month

billed monthly, cancel anytime

✓ Unlimited transcription
✓ Everything in Free
✓ Hotkey channels & routing
✓ AI transforms (local + cloud)
✓ Priority support

Start Free Trial

How much time will you save?

Voice dictation is 3× faster than typing. See what that means for you.

Hours you type per day

1h3h8h

saved/day

20h

saved/month

$1,000

value/month*

*Based on $50/hour average developer rate. Voice is ~3x faster than typing.

Both plans: 100% local. Your voice never leaves your Mac. No cloud. No data collection.

Voice Activity Detection

Talk to AI agents.
Hands free.

SpeechButton transcribes in chunks as you speak. Every time you pause briefly, the chunk is transcribed and pasted instantly. When you finish talking, almost all text is already there.

With auto-Enter enabled, a longer silence (3s) sends the full message automatically. Hold the hotkey, talk to Claude Code or ChatGPT, pause briefly between thoughts — chunks appear in realtime. Stop talking for 3 seconds — message is sent. No keyboard needed.

# Hands-free VAD
[vad]
enabled = true
chunk_silence_sec = 0.7

[global]
auto_send = true
send_delay_sec = 3.0

You speak

"Deploy the latest build to staging..."

Brief pause (0.5s)

Chunk 1 transcribed & pasted instantly

You continue speaking

"...and run the integration tests after"

Another pause → chunk 2 pasted

Previous text already there, only last chunk to process

✓

3s silence → auto-Enter

Full message sent. Zero wait time — text was already pasted in chunks.

Text Transform Pipeline

Transform Your Speech Into Ready‑to‑Send Text

Your speech goes through a pipeline: voice → raw text → prompt-driven transform → destination. You control the transform with a prompt.

🎤 Speech

→

📄 Raw Text

→

✨ Prompt Transform

→

🚀 Destination

✨ Clean up for a social media post

🎤 You say

"so like basically our app is um the fastest dictation tool on mac you know"

✨ Prompt

"Remove filler words, fix grammar, make it punchy for Twitter."

🚀 Delivered

Our app is the fastest dictation tool on Mac.

→ pasted in active app

🤖 Send a task to AI agent

🎤 You say

"fix the login bug where users get a blank screen after entering their password"

✨ Prompt

"Format as: /start-task <description>, priority: high"

🚀 Delivered

/start-task Fix login bug: blank screen after password entry, priority: high

→ sent to Claude Code

🎨 Generate Midjourney Prompt

🎤 You say

"draw me a cat sitting on a cloud watching a sunset, anime style"

✨ Prompt

"Convert to a detailed Midjourney prompt with style keywords and parameters."

🚀 Delivered

A cute cat sitting on a fluffy cloud watching a sunset, anime style, soft pastel colors, Studio Ghibli inspired, golden hour lighting, dreamy atmosphere --ar 16:9 --v 6 --style raw

→ pasted in Midjourney

🌍 Speak Spanish, deliver English

🎤 You say

"necesito revisar el servidor antes de la reunión de mañana"

✨ Prompt

"Translate to English. Keep it natural."

🚀 Delivered

I need to check the server before tomorrow's meeting.

→ pasted in Slack

Voice Routing

Your Voice, Any Destination

Text doesn't just go into your active app. Route your speech to AI agents, work tools, social media, and more — each with its own hotkey channel.

📄

Type in Any App

Paste into active window

🤖

Claude Code

Remote Control API

💻

Windsurf / Copilot

AI coding assistants

💬

Slack / Discord

Team chat

📰

Social Media

Twitter, LinkedIn posts

📝

Notion / Obsidian

Notes & knowledge base

✅

Linear / GitHub

Create issues & tasks

📧

Send via API or script

🚀

Webhooks

Zapier, Make, custom APIs

🖥

Terminal

Run shell commands

📦

Telegram Bot

Channels & groups

🔗

Any API

curl, scripts, you name it

Each destination is a [[hotkey]] channel in your config. Press ⌘+1, ⌘+2, ⌘+3 — each routes to a different place.

Built for Power Users

Hands-free agent workflows, voice-driven automation, and programmable text pipelines.

🗣

Hands-Free with VAD + Auto-Enter

Voice Activity Detection sends text as you speak. Combined with auto-Enter, you can talk to AI agents (Claude Code, ChatGPT, Slack bots) without touching the keyboard at all.

🛠

Text Config File

All settings in a single config.toml file. AI agents can configure SpeechButton programmatically — no GUI needed. Changes apply instantly without restart.

✨

Text Transform Pipeline

Process text before it's pasted: run it through a script, send it to an LLM API, or transform it locally. Get cleaned-up, formatted, or translated text — all from your voice.

🔌

Hotkey Channels

Hold Command, then press 1, 2, or 3 to route your speech to different destinations. Send voice to one agent, then switch to another with a different channel — perfect for multi-agent workflows.

🔗

Webhook & File Output

Send transcribed text to a webhook URL for integrations, or log everything to a file for history and audit. All outputs work simultaneously — paste, file, webhook, and exec at once.

📱

iPhone as Wireless Mic

Use your iPhone as an external microphone. With keep_hot = true the mic stays always-on — no 300ms wake-up delay when you start talking. Same blazing fast response, even over wireless.

{}

Text & JSON Output

Choose output_format = "text" for plain text or "json" for structured data with timestamps, language, and confidence — ideal for programmatic pipelines.

~/.config/speechbutton/config.toml

[global]
model = "parakeet-tdt-0.6b-v3-int8"

# Default: paste transcription at cursor
[[hotkey]]
key = "RightCommand"
name = "default"
paste = "accessibility"

# ⌘+1: clean up speech with Local AI (free, offline)
[[hotkey]]
key = "RightCommand"
channel = "1"
name = "cleanup"
transform = "prompts/cleanup.md"

# ⌘+2: translate to English with Local AI
[[hotkey]]
key = "RightCommand"
channel = "2"
name = "translate"
transform = "prompts/translate_en.md"

# ⌘+3: send to Slack
[[hotkey]]
key = "RightCommand"
channel = "3"
name = "slack"
transform = "prompts/slack_message.md"
exec = "SLACK_WEBHOOK_URL=https://hooks.slack.com/xxx integrations/send_slack.py"

# ⌘+4: create Linear issue
[[hotkey]]
key = "RightCommand"
channel = "4"
name = "linear"
transform = "prompts/linear_issue.md"
exec = "LINEAR_API_KEY=lin_api_xxx integrations/send_linear.py"

# Auto-detect iPhone mic, keep stream hot
[[device_rule]]
match = "iPhone"
keep_hot = true

Frequently Asked Questions

Yes. Your iPhone works as a wireless microphone with zero extra setup via Apple Continuity. SpeechButton automatically detects it and you can even set keep_hot = true to keep the mic always-on for instant response.

Yes! SpeechButton has a full GUI where you can configure everything — hotkeys, channels, transforms, VAD settings. But we also offer a plain-text config.toml file, which is rare for dictation apps. This means an AI agent can configure SpeechButton for you. For powerful routing setups, we recommend using small Python scripts — just tell your agent where you want to send messages, and it will write ~40 lines of code that gives you a seriously powerful voice pipeline.

Blazing Fast
Speech to Text Router

How It Works

Why SpeechButton

100% Private & Local

Multi-Language Support

7ms Instant Capture

AI-Ready Text Config

Fully Local AI Pipeline

Simple Pricing

How much time will you save?

Talk to AI agents.
Hands free.

Transform Your Speech Into Ready‑to‑Send Text

✨ Clean up for a social media post

🤖 Send a task to AI agent

🎨 Generate Midjourney Prompt

🌍 Speak Spanish, deliver English

Your Voice, Any Destination

Built for Power Users

Hands-Free with VAD + Auto-Enter

Text Config File

Text Transform Pipeline

Hotkey Channels

Webhook & File Output

iPhone as Wireless Mic

Text & JSON Output

Frequently Asked Questions

Start Talking, Stop Typing

├─	⌘	→	📝 Paste in active app
├─	⌘+1	→	🤖 Claude Code CEO Agent
├─	⌘+2	→	💬 Slack
└─	⌘+3	→	🔄 Transform → ✅ Linear

Blazing Fast Speech to Text Router

How It Works

Why SpeechButton

100% Private & Local

Multi-Language Support

7ms Instant Capture

AI-Ready Text Config

Fully Local AI Pipeline

Simple Pricing

How much time will you save?

Talk to AI agents. Hands free.

Transform Your Speech Into Ready‑to‑Send Text

✨ Clean up for a social media post

🤖 Send a task to AI agent

🎨 Generate Midjourney Prompt

🌍 Speak Spanish, deliver English

Your Voice, Any Destination

Built for Power Users

Hands-Free with VAD + Auto-Enter

Text Config File

Text Transform Pipeline

Hotkey Channels

Webhook & File Output

iPhone as Wireless Mic

Text & JSON Output

Frequently Asked Questions

Start Talking, Stop Typing

Coming to your platform soon!

You're on the list!

Blazing Fast
Speech to Text Router

Talk to AI agents.
Hands free.