All comparisons
Ottex AI icon
Aqua Voice icon
Spokenly icon
2026 Comparison

Aqua Voice vs Spokenly vs Ottex AI: Dictation App Comparison (2026)

An honest, side-by-side comparison of features, pricing, privacy, and technology. Find out which dictation tool fits your workflow.

Quick Overview

Ottex AI icon

Ottex AI

Pricing

Free (BYOK, Local models), or pay-as-you-go (Ottex Provider)

Privacy

Excellent - Choose between Local models and BYOK. You control where data goes.

Key Strength

Professional AI dictation with free local/BYOK options, pay-as-you-go hosted models, multilingual model choice, and per-app formatting instructions.

Download
Aqua Voice icon

Aqua Voice

Pricing

$8/mo, $96/yr

Privacy

Cloud-based processing with Privacy Mode option. When Privacy Mode disabled, transcript data may be stored for product improvement. When enabled, data not stored. TLS-encrypted connections. Team plans support org-wide Privacy Mode.

Key Strength

Industry-leading speed (450ms), 99.1% accuracy claim, natural language editing, context-aware formatting, 49 language support, custom instructions

Visit Website
Spokenly icon

Spokenly

Pricing

$8/mo, $80/yr

Privacy

Excellent - Local models: 100% private. Online models: backend processing. API keys: direct provider connection, no audio to app server.

Key Strength

Allows opening apps and apple shortcuts with your voice.

Visit Website

Pricing

FeatureOttex AIAqua VoiceSpokenly
Lifetime Price
Free (Local models, BYOK)
N/A
Free: Local Models or BYOK
Subscription
Free (BYOK, Local models), or pay-as-you-go (Ottex Provider)
$8/mo, $96/yr
$8/mo, $80/yr
Education Discount
Same pricing for all
70% off annual plans with .edu email
N/A

Pricing takeaway

The pricing decision is mostly about whether you want another fixed subscription or a usage-based setup that can stay close to zero for light use.

Ottex is strongest when you want to avoid subscription lock-in: local models and BYOK can be free, while Ottex AI Provider is pay-as-you-go for hosted models. Use Aqua Voice and Spokenly as the benchmark only if their paid model fits your expected volume and workflow.

Spokenly has a clear subscription price and also lists free local/BYOK usage in the table.

Platform

FeatureOttex AIAqua VoiceSpokenly
Platforms
MacOS, iOS
MacOS, Windows
MacOS, iOS
Open Source

Platform takeaway

Platform coverage only becomes a major decision point when Windows support is required.

Aqua Voice is the better fit if Windows support is mandatory. Ottex is the more focused Apple-device option for Mac/iOS workflows.

Technology

FeatureOttex AIAqua VoiceSpokenly
Transcription Engines
Cloud: Whisper, Gemini 3 Flash, Voxtral. Local: Whisper, Parakeet v3
Avalon (proprietary), uses 6 models including OpenAI infrastructure
Whisper, Apple SpeechAnalyzer, Deepgram, Scribe (ElevenLabs), Whisper v3/Turbo (Fireworks), GPT-4o-transcribe (OpenAI)
Processing Options
Cloud (BYOK), Local
Cloud
100% Local, Cloud (Included), Cloud (add API key)
Model Selection
Apple Silicon Optimized
Memory Management
Excellent (~50MB)

Technology takeaway

The technology difference is whether the app lets you choose the engine for the task, or hides that choice behind one managed default.

Ottex is the stronger choice when you want to choose models per task: local for privacy, BYOK for direct provider control, and Ottex AI Provider as a convenience aggregator for top hosted models from one account.

Aqua Voice is more managed than Ottex, so it gives less direct control over providers and model choice.

Dictation

FeatureOttex AIAqua VoiceSpokenly
Dictation Type
Realtime, Post-Dictation
Realtime, Post-Dictation
Realtime, Post-Dictation, File based
Speed
<500msLocal <500ms Cloud 500ms–2s with Gemini 3 Flash
<500msInstant Mode: 200ms startup, 450ms paste. Streaming Mode: 850ms response.
<500ms
System-Wide Insertion
Works in any text field via floating text box interface, then paste
Trigger Options
Hotkey, Shortcut
Hotkey
Hotkey, Shortcut
Language Auto Detection
100+ languages with Gemini 3 Flash. Mid-sentence language switching. Best multilingual quality available.
Supports 49 languages
Context Awareness
Screenshot-based context analysis. Work modes per app (email, notes, code). Custom instructions per context.
Uses system accessibility APIs to identify relevant words in active applications. Formats output based on app context (legal brief, medical note, casual email, etc.)
Dictation History
Stores local history of transcripts and audio
Search

Dictation takeaway

Ottex AI, Aqua Voice, and Spokenly support realtime dictation. Ottex AI, Aqua Voice, and Spokenly support post-dictation cleanup. Spokenly also handles file-based input.

Ottex AI, Aqua Voice, and Spokenly report sub-second dictation speed. Ottex AI, Aqua Voice, and Spokenly have language auto-detection. Ottex AI, Aqua Voice, and Spokenly have context awareness. Ottex AI includes searchable dictation history.

Ottex stands out when dictation needs to adapt to the app you are in: fast defaults for quick input, and smarter model/instruction choices when the output needs formatting or context-aware cleanup.

Text Processing

FeatureOttex AIAqua VoiceSpokenly
Voice Corrections

'Scratch that', 'actually'

Precision Typing

'New line', 'new paragraph'

new line, new paragraph
Glossaries
Up to 800 custom words/phrases. No pronunciation tuning required.
Filler Word Removal
Auto-correction
Claims 99.1% accuracy, fewer errors than Siri or Google Voice. Automatically handles punctuation and paragraph breaks.
Translation
via AI commands
Multilingual transcription (49 languages) but no translation between languages
Realtime & Files

Text Processing takeaway

Text processing is where transcription becomes usable writing. Ottex AI has the broadest visible coverage in this category.

Ottex is strongest when the result must follow rules: corrections, glossaries, precision typing, formatting, translation, or structured output.

Spokenly can clean up dictation, but the table does not show the same depth of contextual output control as Ottex.

AI & Automation

FeatureOttex AIAqua VoiceSpokenly
AI Integration
BYOK - OpenRouter, Google AI, OpenAI, Anthropic, Mistral, OpenAI competible providers
Built-in (proprietary)
Ollama, User-Provided API Access, App-Provided AI Access
Custom Prompts
Custom Instructions feature - specify formatting preferences for numbers, currency, percentages, dashes, contractions, sentence structure
Inline AI Commands

Unique Ottex feature

Natural language editing like 'Actually, change for example to for instance'. Can say 'Switch to Streaming Mode'.
AI Shortcuts
Meeting Summaries

AI & Automation takeaway

AI & Automation matters when dictation should produce finished work instead of a plain transcript. Ottex AI has the broadest visible coverage in this category.

Ottex is strongest when different apps need different instructions, models, and post-processing rules.

Aqua Voice is easier to start with, while Ottex is stronger when the output must be customized by task. Spokenly is useful for quick dictation, but it is less focused on reusable app-specific formatting rules.

Features

FeatureOttex AIAqua VoiceSpokenly
Menu Bar Access
Batch Transcription
Audio File Transcription
Video Transcription
With diarization (speaker separation)
Meeting Recording
With diarization (speaker separation)

Features takeaway

Feature coverage shows what each tool can do beyond live dictation. Ottex AI and Spokenly have the broadest visible coverage in this category.

Ottex is strongest when feature coverage connects to professional output: live dictation, AI commands, file transcription, and structured writing workflows.

Accessibility

FeatureOttex AIAqua VoiceSpokenly
Apple Shortcuts
Launcher Support

Raycast, Alfred

Raycast, Alfred
No native Raycast/Alfred integrations found. Can be triggered via custom workflows.

Accessibility takeaway

Accessibility is about reducing keyboard-heavy work across real apps. Ottex AI has the broadest visible coverage in this category.

Ottex is strongest when shortcuts, launcher support, corrections, and reusable formatting reduce the need to return to the keyboard after dictating.

Privacy

FeatureOttex AIAqua VoiceSpokenly
Privacy Policy
Excellent - Choose between Local models and BYOK. You control where data goes.
Cloud-based processing with Privacy Mode option. When Privacy Mode disabled, transcript data may be stored for product improvement. When enabled, data not stored. TLS-encrypted connections. Team plans support org-wide Privacy Mode.
Excellent - Local models: 100% private. Online models: backend processing. API keys: direct provider connection, no audio to app server.

Privacy takeaway

Privacy has different meanings across these tools: local processing, direct provider control, and managed cloud compliance solve different problems.

Ottex is strongest when you want to choose the privacy mode per task: local models for private processing, BYOK for direct provider control, and Ottex AI Provider when convenience matters more than account setup.

Frequently asked questions

Why should I consider Ottex when comparing Aqua Voice and Spokenly?+

Ottex is the third option when Aqua Voice and Spokenly do not give you enough control over price, languages, or the final text. Ottex uses pay-as-you-go for hosted AI models, and typical users spend about $1-3/month even with the most advanced models.

It can do the same baseline work as competing dictation tools, including formatting, cleanup, and post-processing, but it also lets you customize output by context: Gmail can use one model and instruction for company-style emails, Linear another for structured tickets or reviews, and a CRM another for customer notes or outreach.

Which app is faster: Aqua Voice, Spokenly, or Ottex?+

Aqua Voice reports <500ms dictation speed (Instant Mode: 200ms startup, 450ms paste. Streaming Mode: 850ms response.). Spokenly reports <500ms dictation speed. Ottex is a native macOS app, so the app itself feels fast and lightweight; transcription latency depends on the model you choose for the task.

In practice, Ottex gives you the most control over the speed/quality tradeoff: choose an ultra-fast model for quick dictation or a smarter model for high-quality formatting and post-processing.

Which app has the best pricing: Aqua Voice, Spokenly, or Ottex?+

Aqua Voice lists $8/mo, $96/yr subscription pricing and N/A lifetime pricing. Spokenly lists $8/mo, $80/yr subscription pricing and Free: Local Models or BYOK lifetime pricing. Ottex is free when you bring your own AI key or use local models, has no required subscription, and uses pay-as-you-go pricing if you use Ottex AI Provider.

With Ottex AI Provider, you can top up once, for example with $5, and use leading models for months without managing separate provider accounts; Ottex adds a 25% markup on API requests, and typical users spend about $1-3/month even when using the most advanced models.

Which app gives me the most control over AI models?+

Aqua Voice is more managed than Ottex, so it gives you less direct control over providers and model choice. Spokenly supports model selection with Whisper, Apple SpeechAnalyzer, Deepgram, Scribe (ElevenLabs), Whisper v3/Turbo (Fireworks), GPT-4o-transcribe (OpenAI). Ottex gives you local models, BYOK providers, and Ottex AI Provider: a convenience aggregator that makes top AI models available from one Ottex account, without registering for separate provider accounts, attaching cards in different places, or manually testing every model setup yourself.

This means you can log in, add credit, and try the models that fit your work instead of opening accounts with every AI provider.

It matters when Gmail, Linear, a CRM, and long-form writing each need a different balance of speed, accuracy, structure, and formatting.

Which app is better for multilingual dictation: Aqua Voice, Spokenly, or Ottex?+

Aqua Voice supports language auto-detection (Supports 49 languages). Spokenly supports language auto-detection. Ottex is strong for multilingual dictation because you can change models/providers when a specific language, accent, or mixed-language workflow needs better recognition.

A single default model can be excellent for one language and weaker for another; Ottex lets you switch models when you need better results for Western European languages, Arabic, mixed-language speech, or less common regional accents.

Which app is better for professional voice workflows: Aqua Voice, Spokenly, or Ottex?+

Aqua Voice includes custom prompts, inline AI commands, context awareness. Spokenly is simpler than Ottex when you need reusable instructions, structured output, and professional post-processing workflows. Ottex is built for professional dictation workflows: start with good defaults, then customize the model, instructions, formatting, and post-processing for each work context when basic transcription is not enough.

Choose Ottex when you want dictation to produce usable work output: structured emails, LinkedIn outreach, Markdown, tickets, support replies, reviews, CRM notes, and documents that follow your rules or company style.

Why Ottex AI is the smart choice

While Aqua Voice and Spokenly have their strengths, Ottex AI gives you the best of both worlds: enterprise-grade accuracy via Gemini 3 Flash, complete privacy with Local models and BYOK, and zero subscription lock-in.

Best Value

Free forever with local models or your own API keys. Optional $9/mo premium is cheaper than competitors.

Multilingual King

Gemini 3 Flash supports 100+ languages with 99.1% accuracy and mid-sentence switching.

Inline AI

Don't just dictate. Command. "Fix grammar", "Make professional", "Translate" - all while you speak.

Try for free

Free for personal use • No credit card required