Paste an episode URL. Get a speaker-labelled transcript, an SRT ready for TikTok / Reels, and a DOCX show-notes block. First episode on us — no signup.
Built for the indie podcaster workflow
We fix them.
Not "Speaker 1 / Speaker 2" guesswork. We diarize 2–8 speakers cleanly — guest interviews, co-host pods, panels. You fix the names once; the labels stay consistent through the episode.
Caption cues at the right length — not 30-word walls of text. Drop the SRT straight into CapCut, Descript, Premiere, or Riverside. Word-level timestamps included if you need to re-cut.
DOCX export includes a chapter-markers draft, key quotes pulled from the transcript, and timestamps linking back. Paste straight into Substack, Ghost, WordPress, Apple Podcasts.
Sample output
Speaker-labelled paragraphs, clickable timestamps, ready to paste into show notes.
Every format you'd ask for
Clean, de-umm'd paragraphs. Drop into a blog post.
Social-clip-ready cues. Works with every editor.
HTML5 player captions + YouTube uploads.
Formatted with chapters + quote pulls for publishing.
New · clipping for podcasters
Drop your full episode. Get TikTok / Reels / Shorts / LinkedIn-square clips back, with the right speaker on screen and word-level captions burned in. Same transcript, more outputs.
When two or three guests are talking, the clip shows them side by side. Per-speaker single-crop alternates ship in the same drop.
The camera tracks who's actually talking. Cuts happen on speaker change, not mid-sentence. No manual reframe.
Whipscribe reads the episode end-to-end and picks moments that trace a beat — problem → tension → resolution — instead of just the loudest 30 seconds.
9:16 (TikTok / Reels / Shorts), 1:1 (LinkedIn / IG / X), 4:5 (IG feed Meta-preferred), 16:9 (YouTube). Drop once, render everywhere.
Anything else
Word-level accuracy on clean 2-speaker podcast audio (Riverside / SquadCast / local recordings) is typically under 5% Word Error Rate. Heavy accents, music beds, and noisy field recordings hit harder — run a free trial to see for yourself.
Tested up to 8. Three-person panels and interview formats work cleanly. The first minute is usually enough to stabilise speaker identity across the episode.
You can also upload directly at whipscribe.com — mp3, m4a, wav, flac, webm, mov, mp4 all work. Up to 2GB per file on the free tier.
No. Your audio is transcribed and returned to you. We don't fine-tune on user content, don't ship it to third parties, and paid tiers get 365-day retention you can revoke at any time.
Not to try it. The first episode — up to 20 minutes of audio — is free to transcribe without signup. Sign in if you want retention, history, and API access.
A typical weekly podcast runs ~3-4 hours/month of audio. See the pricing page — Pro tier covers most indie creators; pay-as-you-go for one-off needs.
Send your podcast email + a link to your show. We'll email you back within 48 hours with 30 hours of credits loaded.
First episode's on us. No card, no signup, no trial timer — paste an audio URL at the top and we'll transcribe it.
Transcribe my episode →