Transcribe · Summarize · Zero Upload
Your Private AI Scribe
Record both sides of any meeting. Transcribe and summarize locally on your Mac. No cloud, no data ever leaving your machine.
Requires macOS Tahoe · Apple Silicon recommended · Free to try
The Scribe's Craft












Capture Both Sides of the Conversation
Record your microphone and system audio at the same time. No third-party routing tools, no virtual audio drivers. Just hit record.
Built for Mac
Native SwiftUI
Real macOS app. Collapsible sidebars, smooth animations, proper Dark Mode. Runs on AppKit, not Electron.
Export & Share
PDF, Markdown, RTF, or JSON with timestamps and color-coded speakers. Or share any transcript or summary directly to Mail, Messages, and AirDrop in one tap.
99 Languages
Auto-detects the language and transcribes it. English, French, Japanese, Arabic: Whisper handles them all, entirely on your Mac.
Audio Architecture
Two channels.
Zero ambiguity.
Most tools blend your mic and remote audio into a single stream, then guess who spoke when. Thoth keeps them separate from the very first sample. No guesswork needed.
Channel 1 · Mic
Your Voice
Dedicated stream from your microphone. Captured independently and identified as you before transcription starts. Never mixed with remote audio.
Always Speaker 1Channel 2 · System
Remote Voices
Captured from Zoom, Teams, Meet or any app via the macOS audio engine. A completely independent stream. No screen capture, no bot joining the call.
Speaker 2 and beyondNo guesswork. Every word attributed correctly.
When channels are separate, speaker attribution for remote participants is deterministic, not estimated. Mic is always you. System audio is always them. Even when everyone talks at once.
Absolute Privacy
Under the Scribe's Seal.
Your data stays yours. Built from day one for people who handle confidential information.
100% Offline Engine
Transcription runs on your Mac through WhisperKit and CoreML. No audio ever leaves your machine.
Local Speaker Detection
An on-device engine detects who is speaking and color-codes your transcript. No cloud processing involved.
On-Device AI Summaries
Choose from five local models (1.9 GB to 6.8 GB) and run AI summaries entirely on your Mac. No API key, no cloud.
Bring Your Own Key
Prefer OpenAI, Anthropic, or Google? Use your own API keys. Requests go direct from your Mac to the provider.*
Keychain Secured
API credentials stay locked in your Apple Keychain. Thoth never stores keys in plaintext or sends them anywhere.
Under the Cartouche
Real numbers, real hardware.
Benchmarked on a 42-minute recording, Apple M2 MacBook Pro. How we measure this.
Re-transcription · 42-min recording, M2 MacBook Pro
| Model | Speed | WER · accented EN | WER · clean EN | Languages |
|---|---|---|---|---|
| Whisper Large V3 Turbo | 12.7× | 32.3% | 7.8% | 99 |
| Parakeet TDT V3 Pro | 180× | 38.9% | 8.7% | 25 |
| Whisper Small | 17.7× | 45.9% | 9.2% | 99 |
| Whisper Base | 59.6× | 51.1% | 9.2% | 99 |
Live transcription · Script A, French-accented English
| Engine | WER | Latency |
|---|---|---|
| Parakeet EOU 120M Pro | 38.4% | ~160 ms |
| Parakeet Sliding Window Pro | 56.8% | ~11 s |
| WhisperKit Base+Small | 65.9% | ~12 s |
Large V3 Turbo wins on accuracy. Best on all three scripts: 32.3% on French-accented English, 7.8% on clean audio. If the transcript needs to be right, this is the one.
Parakeet is 14x faster on the same file. Near-identical WER on clean speech (8.7% vs 7.8%). Falls behind on accented speech and code-switching. Worth it when speed matters and audio is clean.
Parakeet EOU is a different category. Word-by-word output at ~160 ms latency. Comparing its 38.4% WER to batch models isn't fair: it's a streaming engine optimised for real-time, not accuracy.
Published benchmarks are optimistic. Every model ran 10-30x worse on accented or foreign-language speech than studio numbers suggest. Real meetings are harder than LibriSpeech.
Local vs Cloud AI Summaries
Tested on a real French-language interview transcript. Scored by Claude Opus across 6 criteria.
Local · Qwen 7B
~5/10
Fast and private. Good for quick overviews. Struggles with nuanced decision capture and quote selection.
Cloud · Claude Sonnet (BYOK)
~8.7/10
Captures operational detail, adapts to content type, better quote selection.
| Local (Qwen 7B) | Cloud (Claude Sonnet) | |
|---|---|---|
| Factual accuracy | 7/10 | 9.5/10 |
| Completeness | 5/10 | 9/10 |
| Decision capture | 2/10 | 8.5/10 |
| Action items | 5/10 | 8/10 |
| Quote selection | 4/10 | 8.5/10 |
| Language quality | 7/10 | 9/10 |
| Overall | ~5/10 | ~8.7/10 |
| Privacy | Zero data leaves | Text sent to provider |
| Cost | Free | ~$0.01/hour |
| Internet required | No | Yes |
Honest takeaway: local models are best when privacy is non-negotiable or internet is unavailable. Cloud AI is better when depth matters.
The privacy guarantee stays constant regardless of which you choose. Audio never leaves your machine. If you use BYOK cloud AI, only the transcript text goes to your chosen provider, directly with your key. Thoth never sees it.
How We Compare
Local-first. Always.
Cloud recorders are convenient. They're also always listening.
| Thoth | Otter | Fireflies | Granola | |
|---|---|---|---|---|
| Audio stays on your Mac | ✓ | ✗ | ✗ | ✗ |
| No bot joins your call | ✓ | ✗ | ✗ | ✓ |
| Works fully offline | ✓ | ✗ | ✗ | ✗ |
| Dual-channel recording | ✓ | ✗ | ✗ | ✗ |
| On-device AI summaries | ✓ | ✗ | ✗ | ✗ |
| Native Mac app | ✓ | ✗ | ✗ | ✓ |
Competitor features are approximate and subject to change. Otter, Fireflies, and Granola are trademarks of their respective owners.
Pricing
Try free, then go Pro.
Start free with full transcription features. Upgrade for unlimited duration and AI.
Free
$0
- Unlimited recordings
- 30 min (mic) / 15 min (system audio)
- 10 AI enhancements/month (local or cloud)
- WAV audio export
- TXT transcript export
Pro
- Unlimited recording duration
- System Audio & Mixed recording
- M4A, AAC, Markdown, RTF, JSON, PDF export
- Unlimited AI enhancements (local or cloud with your key)*
- Large transcription model
- Remove branding from exports and shares
Free trial included with subscription
*Cloud AI features (OpenAI, Anthropic, Google) may not be available in all countries due to local regulations. On-device AI is available everywhere.
𓇳 Coming in the next update
The free tier is getting more generous.
- Unlimited recordings, no lifetime cap
- 10 AI actions per month (up from 3)