Thoth

Transcribe · Summarize · Zero Upload

Your Private AI Scribe

Record both sides of any meeting. Transcribe and summarize locally on your Mac. No cloud, no data ever leaving your machine.

Requires macOS Tahoe · Apple Silicon recommended · Free to try

Built for Mac

Native SwiftUI

Real macOS app. Collapsible sidebars, smooth animations, proper Dark Mode. Runs on AppKit, not Electron.

Export & Share

PDF, Markdown, RTF, or JSON with timestamps and color-coded speakers. Or share any transcript or summary directly to Mail, Messages, and AirDrop in one tap.

99 Languages

Auto-detects the language and transcribes it. English, French, Japanese, Arabic: Whisper handles them all, entirely on your Mac.

Audio Architecture

Two channels.
Zero ambiguity.

Most tools blend your mic and remote audio into a single stream, then guess who spoke when. Thoth keeps them separate from the very first sample. No guesswork needed.

Channel 1 · Mic

Your Voice

Dedicated stream from your microphone. Captured independently and identified as you before transcription starts. Never mixed with remote audio.

Always Speaker 1

Channel 2 · System

Remote Voices

Captured from Zoom, Teams, Meet or any app via the macOS audio engine. A completely independent stream. No screen capture, no bot joining the call.

Speaker 2 and beyond

No guesswork. Every word attributed correctly.

When channels are separate, speaker attribution for remote participants is deterministic, not estimated. Mic is always you. System audio is always them. Even when everyone talks at once.

Absolute Privacy

Under the Scribe's Seal.

Your data stays yours. Built from day one for people who handle confidential information.

100% Offline Engine

Transcription runs on your Mac through WhisperKit and CoreML. No audio ever leaves your machine.

Local Speaker Detection

An on-device engine detects who is speaking and color-codes your transcript. No cloud processing involved.

On-Device AI Summaries

Choose from five local models (1.9 GB to 6.8 GB) and run AI summaries entirely on your Mac. No API key, no cloud.

Bring Your Own Key

Prefer OpenAI, Anthropic, or Google? Use your own API keys. Requests go direct from your Mac to the provider.*

Keychain Secured

API credentials stay locked in your Apple Keychain. Thoth never stores keys in plaintext or sends them anywhere.

Under the Cartouche

Real numbers, real hardware.

Benchmarked on a 42-minute recording, Apple M2 MacBook Pro. How we measure this.

Re-transcription · 42-min recording, M2 MacBook Pro

Model Speed WER · accented EN WER · clean EN Languages
Whisper Large V3 Turbo 12.7× 32.3% 7.8% 99
Parakeet TDT V3 Pro 180× 38.9% 8.7% 25
Whisper Small 17.7× 45.9% 9.2% 99
Whisper Base 59.6× 51.1% 9.2% 99

Live transcription · Script A, French-accented English

Engine WER Latency
Parakeet EOU 120M Pro 38.4% ~160 ms
Parakeet Sliding Window Pro 56.8% ~11 s
WhisperKit Base+Small 65.9% ~12 s
7.72 s to diarize 42 min of audio with 2 speakers Up to 8 speakers · Fully on-device · Pyannote CoreML

Large V3 Turbo wins on accuracy. Best on all three scripts: 32.3% on French-accented English, 7.8% on clean audio. If the transcript needs to be right, this is the one.

Parakeet is 14x faster on the same file. Near-identical WER on clean speech (8.7% vs 7.8%). Falls behind on accented speech and code-switching. Worth it when speed matters and audio is clean.

Parakeet EOU is a different category. Word-by-word output at ~160 ms latency. Comparing its 38.4% WER to batch models isn't fair: it's a streaming engine optimised for real-time, not accuracy.

Published benchmarks are optimistic. Every model ran 10-30x worse on accented or foreign-language speech than studio numbers suggest. Real meetings are harder than LibriSpeech.

Local vs Cloud AI Summaries

Tested on a real French-language interview transcript. Scored by Claude Opus across 6 criteria.

Local · Qwen 7B

~5/10

Fast and private. Good for quick overviews. Struggles with nuanced decision capture and quote selection.

Cloud · Claude Sonnet (BYOK)

~8.7/10

Captures operational detail, adapts to content type, better quote selection.

Local (Qwen 7B)Cloud (Claude Sonnet)
Factual accuracy7/109.5/10
Completeness5/109/10
Decision capture2/108.5/10
Action items5/108/10
Quote selection4/108.5/10
Language quality7/109/10
Overall~5/10~8.7/10
PrivacyZero data leavesText sent to provider
CostFree~$0.01/hour
Internet requiredNoYes

Honest takeaway: local models are best when privacy is non-negotiable or internet is unavailable. Cloud AI is better when depth matters.

The privacy guarantee stays constant regardless of which you choose. Audio never leaves your machine. If you use BYOK cloud AI, only the transcript text goes to your chosen provider, directly with your key. Thoth never sees it.

How We Compare

Local-first. Always.

Cloud recorders are convenient. They're also always listening.

Thoth Otter Fireflies Granola
Audio stays on your Mac
No bot joins your call
Works fully offline
Dual-channel recording
On-device AI summaries
Native Mac app

Competitor features are approximate and subject to change. Otter, Fireflies, and Granola are trademarks of their respective owners.

Pricing

Try free, then go Pro.

Start free with full transcription features. Upgrade for unlimited duration and AI.

Free

$0

  • Unlimited recordings
  • 30 min (mic) / 15 min (system audio)
  • 10 AI enhancements/month (local or cloud)
  • WAV audio export
  • TXT transcript export

Pro

$9.99/mo

$79.99/year · €99.99 lifetime

  • Unlimited recording duration
  • System Audio & Mixed recording
  • M4A, AAC, Markdown, RTF, JSON, PDF export
  • Unlimited AI enhancements (local or cloud with your key)*
  • Large transcription model
  • Remove branding from exports and shares

Free trial included with subscription

*Cloud AI features (OpenAI, Anthropic, Google) may not be available in all countries due to local regulations. On-device AI is available everywhere.

𓇳 Coming in the next update

The free tier is getting more generous.

  • Unlimited recordings, no lifetime cap
  • 10 AI actions per month (up from 3)

The Scribe Awaits

Private by default.
No cloud required.

Accurate transcripts and AI insights, running entirely on your hardware. No accounts, no uploads, no data leaving your Mac.

Download on the Mac App Store