Thoth

Transcribe · Summarize · Zero Upload

Your Private AI Scribe

Record both sides of any meeting. Transcribe and summarize locally on your Mac. No cloud, no data ever leaving your machine.

Requires macOS Tahoe · Apple Silicon recommended · Free to try

Built for Mac

Native SwiftUI

Real macOS app. Collapsible sidebars, smooth animations, proper Dark Mode. Runs on AppKit, not Electron.

Export Anywhere

PDF, Markdown, RTF, or JSON with timestamps and color-coded speakers. Share or archive however you want.

Batch Import

Drop in voice notes, lectures, or old meeting files. Thoth processes them in batch on your Apple Silicon chip.

Absolute Privacy

Under the Scribe's Seal.

Your data stays yours. Built from day one for people who handle confidential information.

100% Offline Engine

Transcription runs on your Mac through WhisperKit and CoreML. No audio ever leaves your machine.

Local Speaker Detection

An on-device engine detects who is speaking and color-codes your transcript. No cloud processing involved.

On-Device AI Summaries

Choose from five local models (1.9 GB to 6.8 GB) and run AI summaries entirely on your Mac. No API key, no cloud.

Bring Your Own Key

Prefer OpenAI, Anthropic, or Google? Use your own API keys. Requests go direct from your Mac to the provider.*

Keychain Secured

API credentials stay locked in your Apple Keychain. Thoth never stores keys in plaintext or sends them anywhere.

Under the Hood

Real numbers, real hardware.

Benchmarked on a 42-minute recording, Apple M2 MacBook Pro.

Transcription Performance

  • 3.3 min to transcribe 42 min of audio 12.7× realtime
  • All processing on Apple Neural Engine via CoreML
  • 99 languages with auto-detection

Large V3 Turbo model

Diarization Performance

  • 7.72 seconds for 42 min of audio with 2 speakers
  • Up to 8 speakers supported
  • Mixed audio (Zoom, Teams, Meet): attribution is deterministic — mic and system audio are separate streams, no ambiguity

Fully on-device via Pyannote CoreML

Local vs Cloud AI Summaries

Tested on a real French-language interview transcript. Scored by Claude Opus across 6 criteria.

Local — Qwen 7B

~5/10

Fast and private. Good for quick overviews. Struggles with nuanced decision capture and quote selection.

Cloud — Claude Sonnet (BYOK)

~8.7/10

Captures operational detail, adapts to content type, better quote selection.

Local (Qwen 7B)Cloud (Claude Sonnet)
Factual accuracy7/109.5/10
Completeness5/109/10
Decision capture2/108.5/10
Action items5/108/10
Quote selection4/108.5/10
Language quality7/109/10
Overall~5/10~8.7/10
PrivacyZero data leavesText sent to provider
CostFree~$0.01/hour
Internet requiredNoYes

Honest takeaway: local models are best when privacy is non-negotiable or internet is unavailable. Cloud AI is better when depth matters.

The privacy guarantee stays constant regardless of which you choose. Audio never leaves your machine. If you use BYOK cloud AI, only the transcript text goes to your chosen provider, directly with your key. Thoth never sees it.

Pricing

Try free, then go Pro.

Start free with full transcription features. Upgrade for unlimited recordings and AI.

Free

$0

  • 5 recordings
  • 30 min (mic) / 15 min (system audio)
  • 3 AI enhancements/month (local or cloud)
  • WAV audio export
  • TXT transcript export

Pro

$9.99/mo

$79.99/year · $149.99 lifetime

  • Unlimited recordings & duration
  • System Audio & Mixed recording
  • M4A, AAC, Markdown, RTF, JSON, PDF export
  • Unlimited AI enhancements (local or cloud with your key)*
  • Large transcription model

Free trial included with subscription

*Cloud AI features (OpenAI, Anthropic, Google) may not be available in all countries due to local regulations. On-device AI is available everywhere.

The Scribe Awaits

Private by default.
No cloud required.

Accurate transcripts and AI insights, running entirely on your hardware. No accounts, no uploads, no data leaving your Mac.

Download on the Mac App Store