Apple Dictation vs EnviousWispr: Custom Words on Mac

Q: Can EnviousWispr remove filler words from dictation?

Yes. When AI polish is enabled, filler words like um, uh, you know, and like are stripped automatically. Apple Dictation transcribes them as-is.

Q: Does EnviousWispr support voice commands like Apple Dictation?

No. Apple Dictation supports voice commands such as new paragraph, comma, and emoji names. EnviousWispr does not process voice commands during recording. Instead, AI polish handles punctuation, paragraph breaks, and formatting automatically based on context.

Side by Side

Feature comparison

Apple Dictation is a solid default. Here is where a dedicated dictation app pulls ahead.

	EnviousWispr	Apple Dictation
Price	Free. No subscription.	Free, built into macOS
Setup required	One-time download + model install (~2 min)	None. Already installed.
Account required	No	Apple ID (for macOS itself)
Audio processing	On-device (Parakeet TDT + WhisperKit)	On-device (Apple Silicon); cloud fallback on Intel
Timeout	No timeout. Handles 1-2 minute dictation bursts.	Stops after ~30s of silence
AI polish	EG-1, Apple Intelligence, Ollama (on-device), or BYO OpenAI/Gemini key	None
Filler word removal	Yes, automatic via AI polish	No (transcribes "um", "uh" faithfully)
Writing style control	Yes (voice-preserving AI polish that adapts to what you said)	No
Custom vocabulary	Custom words with 6-pass fuzzy matching + AI alias suggestions	Not available
Transcription speed	0.43s median; ~1.5s with AI polish	Real-time streaming (words appear as you speak)
Accuracy (clean audio)	High (Parakeet TDT 0.6B + AI polish correction)	90-92% (degrades significantly with background noise)
First-word capture	Yes (always-listening VAD, no warm-up gap)	Can miss the first word while the mic activates
Punctuation	Auto-punctuation + AI polish reformats	Auto-punctuation, emoji dictation, voice commands
Voice commands	Not supported	"New paragraph", "comma", "smiley face"
Clipboard preservation	Yes (saves and restores clipboard around paste)	No (inserts directly at cursor)
Auto-stop on silence	Yes (configurable VAD sensitivity)	Stops listening entirely after ~30s silence
Transcript history	Yes (searchable transcript log)	No history
Language support	English (Parakeet), 90+ via WhisperKit	Dozens of locales
Accessibility	Keyboard-driven hotkey, menu bar app, VoiceOver compatible	Deep OS integration, mixed keyboard + dictation input
Works offline	Yes (after one-time model download)	Yes on Apple Silicon; No on Intel
Platform	macOS (Apple Silicon required)	macOS, iOS, iPadOS, visionOS
Source code	Open source on GitHub (GPLv3)	Closed source

EnviousWispr latency from production PostHog data (environment=production) on Apple Silicon Macs. Apple Dictation accuracy from published third-party benchmarks. Apple Dictation features verified against Apple Support documentation and firsthand testing on macOS 15. Last verified: April 2026.

Download Free

Why Upgrade

What a dedicated dictation app gives you

Apple Dictation works. EnviousWispr works harder. Here is what changes when you move beyond the built-in. For more on why on-device matters, read on-device vs cloud dictation privacy.

🧠

AI polish cleans your words

Apple Dictation gives you raw transcription. EnviousWispr pipes text through AI polish to remove filler words, fix grammar, and format paragraphs. Choose EG-1, Apple Intelligence, Ollama, or your own OpenAI/Gemini key.

📖

Custom words that stick

Names, product terms, technical jargon. Apple Dictation has no custom vocabulary feature. EnviousWispr lets you add custom words so "Parakeet TDT" and "Kubernetes" come out right every time.

⏱️

No 30-second timeout

Apple Dictation stops listening after roughly 30 seconds of inactivity. EnviousWispr has no timeout. Dictate a full paragraph, pause to think, keep going. Designed for 1-2 minute bursts without interruption.

⚡

0.43s to finished text

Median time from end of speech to text in clipboard: 0.43 seconds without polish, 1.5 seconds with AI polish. Processed locally on the Neural Engine. No network dependency.

🔇

Filler words removed

Uh, um, you know, like. Apple Dictation transcribes them faithfully. EnviousWispr's AI polish strips them out so your text reads like writing, not a transcript.

🔍

Auditable source code

Every line is on GitHub under GPLv3. See exactly how your audio is processed. Apple Dictation is a black box built into the OS.

Speed

Dedicated models, dedicated speed

EnviousWispr uses purpose-built ASR models optimized for Apple Silicon. The result is fast, accurate transcription with optional AI refinement.

0.43s

Median transcription

From end of speech to raw text

1.5s

With AI polish

EG-1 or Apple, on-device

Timeout limit

No inactivity timeout. Up to a full hour per recording.

Based on production data from Apple Silicon Macs. Results vary by hardware and settings.

Under the Hood

Where a dedicated app goes deeper

Apple Dictation gives you raw speech-to-text. EnviousWispr adds layers that turn dictation into polished writing.

📖

Custom vocabulary and smart correction

Apple Dictation has no custom vocabulary feature. If you work with specialized terminology, product names, or technical jargon, you get whatever the built-in model guesses. "Parakeet TDT" might come out as "parakeet teddy." "Kubernetes" might become "Cooper Netties."

EnviousWispr lets you add custom words with a 6-pass fuzzy matching system. Each word can have aliases (phonetic variants the ASR model might produce), and the app suggests aliases automatically using AI analysis of likely misrecognitions. When the model outputs "envious whisper," the custom words system corrects it to "EnviousWispr" before the text reaches your clipboard.

This works at the post-processing layer, so it applies regardless of which ASR backend you use. Add a word once, and it corrects everywhere.

How it works: Custom words are matched using Levenshtein distance, phonetic similarity, case-insensitive comparison, substring containment, and word-boundary-aware replacement. AI alias suggestions use the LLM to predict how speech recognition models typically mangle specific terms.

🧠

AI polish transforms raw speech

Apple Dictation gives you exactly what you said, including every "um," "uh," "you know," and false start. For quick one-liners, that is fine. For anything longer, you end up editing the transcript to make it readable.

EnviousWispr pipes transcription through an AI polish step that removes filler words, fixes grammar, formats paragraphs, and adjusts tone. You can choose EG-1 (our own on-device model), Apple Intelligence (fully on-device), Ollama (local open-source models), or bring your own OpenAI or Gemini API key.

The polish layer includes hallucination safeguards: short transcripts bypass the LLM entirely, output length is validated against input length, and preamble artifacts ("Certainly! Here is...") are stripped automatically. Your dictated text is wrapped in XML tags with explicit instructions to polish, not answer or rewrite.

How it works: Context-aware prompts include the target app name, detected language, and ASR error patterns. Sandwich framing wraps the transcript in <transcript> tags to prevent prompt injection. Output exceeding 3x input length is rejected as probable hallucination, falling back to raw text.

Being Honest

Choose Apple Dictation if you only need occasional built-in dictation

Apple Dictation is built into the OS for a reason. It does some things that a third-party app cannot match:

🚀

Zero setup required

Apple Dictation is already on your Mac. Turn it on in System Settings, press the microphone key, and start talking. No download, no model installation, no permissions to grant. It just works.

📱

Works on every Apple device

iPhone, iPad, Mac, Vision Pro. Apple Dictation follows you across the ecosystem with consistent behavior. EnviousWispr is macOS only and requires Apple Silicon.

🗣️

Voice commands for formatting

Say "new paragraph," "comma," "exclamation mark," or even "smiley face" and Apple Dictation inserts the right character. EnviousWispr does not support voice commands for punctuation or formatting.

🌍

No separate app needed

Apple Dictation is part of the operating system. No menu bar icon, no extra process, no Accessibility permissions to approve. For light dictation use, the built-in option carries zero overhead.

⌨️

Mixed keyboard and voice input

On macOS 14+, you can type on the keyboard while Dictation is active. Switch freely between voice and keys mid-sentence. EnviousWispr uses a record-then-paste model, so you speak first, then the text appears.

If you dictate more than a few sentences at a time and want polished output with custom vocabulary, give EnviousWispr a try. It runs alongside Apple Dictation with no conflicts.

FAQ

Common questions

Is there a better alternative to Apple Dictation on Mac?

EnviousWispr is a free, on-device alternative that adds AI polish, custom vocabulary, and no timeout limit. It runs alongside Apple Dictation without interfering. Both use Apple Silicon for local processing.

Can I add custom words to Mac dictation?

Apple Dictation does not support custom vocabulary. EnviousWispr lets you add custom words for names, brands, and technical terms. The app applies them during transcription so specialized words come out correctly.

Why does Apple Dictation stop listening after 30 seconds?

Apple Dictation has a built-in inactivity timeout. If you pause speaking for too long, it stops. EnviousWispr has no timeout, so you can pause to think and continue dictating without restarting.

Does EnviousWispr replace Apple Dictation?

It runs alongside Apple Dictation, not instead of it. You can keep Apple Dictation enabled for quick one-liners and use EnviousWispr for longer dictation where you want AI polish and custom words. They use different hotkeys and do not conflict.

Is EnviousWispr really free?

Yes. No subscription, no usage limits, no account required. Download and use it. The source code is available on GitHub under a GPLv3 license.

Does EnviousWispr work offline?

Transcription runs entirely on-device after a one-time model download. AI polish can also run locally via EG-1, Apple Intelligence, or Ollama. No internet required for the full pipeline.

How accurate is EnviousWispr compared to Apple Dictation?

EnviousWispr uses Parakeet TDT, a modern ASR model, combined with AI polish that fixes misrecognitions and grammar. Apple Dictation achieves ~90% in quiet conditions (third-party tests) in clean environments but degrades significantly with background noise. EnviousWispr's AI polish layer provides an additional correction pass that Apple Dictation lacks.

Can EnviousWispr remove filler words from dictation?

Yes. When AI polish is enabled, filler words like "um," "uh," "you know," and "like" are stripped automatically. Apple Dictation transcribes them as-is. If you dictate longer passages, filler removal makes a noticeable difference in readability.

Does EnviousWispr support voice commands like Apple Dictation?

No. Apple Dictation supports voice commands such as "new paragraph," "comma," and emoji names. EnviousWispr does not process voice commands during recording. Instead, AI polish handles punctuation, paragraph breaks, and formatting automatically based on context.

What Mac do I need?

Any Mac with Apple Silicon (M1 or later) running macOS 14 Sonoma or newer. The Neural Engine on Apple Silicon is what makes on-device transcription fast. See the 2-minute getting started guide.

Can I use both Apple Dictation and EnviousWispr at the same time?

Yes. They use separate hotkeys and separate audio pipelines. Keep Apple Dictation for quick inline corrections and voice commands. Use EnviousWispr for longer dictation where you want AI polish, custom vocabulary, and transcript history. They do not interfere with each other.

Compare with other tools

vs WisprFlow vs Superwhisper vs Dragon vs Otter.ai vs MacWhisper vs VoiceInk vs Willow Voice vs Google Docs vs Notta vs whisper.cpp

Try free dictation that works in any Mac app

Free to download. No account required. Runs alongside Apple Dictation.

Download Free See all comparisons Back to Home