100% LOCAL VOICE DICTATION · WINDOWS
Speak. It types.
Nothing ever leaves your PC.
Hold a key, talk, release — your words appear in whatever app you're using. Whisper AI runs on your own machine: no cloud, no accounts, no subscription, no one listening.
▲ THE HUD — LIVE WAVEFORM, THEN PROOF OF WHAT IT HEARD
Types anywhere
Email, chat, docs, code, games — if your cursor blinks there, REMsound types there. Hold Right Ctrl or go hands-free with F9.
Voice commands
"scratch that" erases your last dictation. "search for anything" Googles it. Build your own commands to open apps, press keys, run anything.
Private by physics
OpenAI's Whisper model runs on your CPU/GPU. Audio is transcribed in RAM and never uploaded — there's no server to send it to.
THE GUIDE
From zip to talking in about a minute
- Download & unzip Grab the zip with the button above, right-click → Extract All… — anywhere you like.
- Double-click INSTALL.bat Copies REMsound to your user folder and puts icons on your Desktop and Start Menu. Prefer portable? Skip this — REMsound.exe runs straight from the folder.
- Launch REMsound A teal microphone appears in your system tray and the HUD flashes “ONLINE — hold [right ctrl] to dictate”. First dictation downloads the speech model (~75 MB) — one time, then it's fully offline.
Hold Right Ctrl, speak, release. That's the whole skill. Tap F9 instead for hands-free mode — talk as long as you like, tap again to finish.
Cyan, waveform dancing to your voice. Flat bars = check your mic.
Amber sweep — speech becoming text. Usually under a second.
Shows exactly what it typed. Your proof-of-delivery, every time.
Changed your mind mid-sentence? Click the ✕ on the pill — cancelled, nothing typed.
Then try saying (on their own): “scratch that” · “select all” · “press enter” · “open notepad” · “search for northern lights tonight” — and inside a sentence: “new line”, “new paragraph”, “smiley face”.
Everything happens on your machine. REMsound uses the open-source Whisper speech model via faster-whisper, running on your own processor. Your voice is captured, transcribed in memory, typed, and gone.
- One network request, ever: the first run downloads the speech model from Hugging Face. After that, airplane mode works fine.
- No accounts, no telemetry, no analytics — there is literally no server.
- Your dictation history is a plain text file on your disk, for you alone. One toggle turns it off.
CONTROL DECK
Every single behaviour is yours to rewire
REMsound ships with its own settings app — no config files needed. Change anything, hit Save & Apply, and the running app reloads instantly.

- Hotkeys — any key or combo for push-to-talk and hands-free.
- Your own voice commands — open apps and URLs, press key combos, run anything, with wildcard phrases.
- Text shortcuts — say “my address”, get your address.
- Per-app styles — lowercase in Discord, code-friendly in VS Code, automatically.
- The AI itself — five model sizes, from instant to studio-grade; teach it names it misspells.
- Neon theming — recolor the HUD to any glow you like.
- History — searchable log of everything you've dictated, kept on your disk.
Give your keyboard a rest.
One zip. No installer wizards, no accounts, no cloud.
⬇ DOWNLOAD REMSOUND FOR WINDOWS