99languages
Multilingual foundation
Whisper is built for automatic speech recognition and translation to English output.
Privacy-first dictation for Mac
KimiTalk is the private dictation app for your Mac.
No cloud upload for recognition. Recognition runs locally.
Currently v0.9.19 · See all changes →
Time saved calculator
Measure or estimate your typing speed and see how much time speech input can save compared with desktop keyboard entry. The baseline values are backed by published keyboard and speech-rate research.
Your typing speed
Estimate it, enter it, or measure it in a short test.Type freely for 30 seconds. The estimate uses the common WPM standard: 5 characters count as 1 word.
Workflow presets
Baseline: 52 words/min typing + 150 words/min speech
Estimated saved time
2.1 hrs saved per weekThe longer the text gets, the more speaking starts to pull ahead of typing.
Your advantage
2.9× faster Speech compared with your typingWrite anywhere
KimiTalk places text into the active text field: Mail, Notes, browsers, Slack, Notion, VS Code, ChatGPT, Claude, Cursor, and similar Mac apps. AI tools are destination apps, not a requirement. KimiTalk does not upload raw audio for recognition; text you paste or send in cloud apps follows the rules of that destination app.
You say
Hi everyone thanks for the update next let's clarify the open points review the changes and agree on the release date
KimiTalk writes
KimiTalk works in the background
Why it matters
Focus
Trust The study by Vimalkumar et al. shows that trust helps decide whether people use speech systems. KimiTalk strengthens that trust because recognition runs locally on your Mac.System recommendation
Dictation is always the base. Once you add rewriting or translation, your Mac needs more reserve because Gemma 4 runs locally alongside Whisper.
KimiTalk chooses the right model locally on your Mac when the app starts. This website does not probe your hardware automatically and stores nothing here.
What do you want to do?
Choose what you want to add to local dictation.Recommended baseline
Dictation always stays local. Gemma 4 is added only when you use local text features or translation.
Model basis
The app chooses the variant that fits your device, so recognition can stay local and smooth.
99languages
Whisper is built for automatic speech recognition and translation to English output.
32→4decoder layers
Large-v3-Turbo greatly reduces the decoder stack, making it optimized for fast local transcription.
16 GBrecommended
For smooth dictation while other apps are running, we recommend Apple silicon with 16 GB RAM or more.
Sources: OpenAI Whisper, large-v3-turbo model card.
Basic vs. Pro
Basic covers local dictation and file transcription. Pro adds history, speaker-aware transcripts, long-form capture, and local tools for shaping text after you speak.
Basic
For users who want private speech-to-text, app switching, mixed-language dictation, file transcription, and cleanup.
€24.90 lifetime licensePro
For users who turn spoken thoughts and long-form audio into structured transcripts, projects, documents, messages, and prompts when needed.
from €6.99/monthTranslation to English output requires a Large-v3 model with Whisper Translate support, such as quantized Large-v3 (~626 MB). Turbo models are optimized for fast transcription and do not support this Translate mode.
Buying options
Checkout is not live yet. These buttons will redirect to Lemon Squeezy once payments are available.
€24.90 lifetime license
Local dictation, cleanup style, and updates until the next major version.
€6.99 /month
Basic plus history, speaker-aware transcripts, long-form capture, local text tools, and all future Pro features.
€99 one-time
Pay once and keep Pro. Usually pays for itself after roughly two years.
Good to know
The essentials about trial, Pro, licensing, and local work.
Yes. After onboarding, you can try KimiTalk with 1,000 free words. Your settings remain available afterwards; you can keep dictating as soon as you activate a license.
Pro is for rough dictation, files, and longer recordings that should become cleaner, more structured, or better suited to their context. It includes history, speaker-aware transcripts, long-form capture, local text tools, and future Pro features.
One license covers up to three Macs. KimiTalk validates signed license tokens locally and only refreshes them occasionally online.
Recognition runs on your Mac with the downloaded model. Raw audio is not uploaded for recognition. If you paste or send text into cloud apps like ChatGPT, Claude, or Cursor, that text follows the rules of the destination app.