Drop a long episode ship clips that finish their thought.
Whipscribe picks moments with a setup, a turn, and a payoff — not 30-second laugh clips. Every clip comes back in four aspect ratios with burned-in captions and an AI-suggested title for each platform.
What you get
What a real podcast clip needs to land.
A complete thought
Most clip tools cut on volume spikes — you get clips that start mid-sentence and end on a half-laugh. Our model scores setup-turn-payoff structure and only picks clips a viewer can follow without context.
Multi-speaker split-screen
Two-host episodes get a vertical split-screen with auto-zoom on whoever is talking. Host on top, guest on bottom; the camera switches with the conversation. No manual editing required.
Every aspect ratio at once
9:16 for TikTok, Reels, Shorts. 1:1 for X and the LinkedIn feed. 4:5 for the LinkedIn carousel-adjacent slot. 16:9 for YouTube embeds. One drop, four exports — no re-render, no second job.
AI-named titles per platform
TikTok gets one hook style; LinkedIn gets a different framing; Shorts gets a third. Every clip ships with a suggested title for each surface — burned into the upload, ready to copy into the platform field.
Why story-arc selection beats hot-moment selection
Random hot-moment clips vs story-arc clips.
✗ Hot-moment clip tools
Cut on volume spikes, laughs, or keyword density. Result: clips that start with 'and so I told him —' and end mid-laugh. Viewers bounce in three seconds.
- Cuts mid-sentence, no setup
- No context — viewer drops in 3 sec
- Single aspect ratio per render
- Generic 'Funny Moment' titles
- No multi-speaker handling
✓ Whipscribe story-arc clips
Each clip has a setup line, a turn, and a payoff. The viewer enters with context and leaves having heard a complete thought — which is what makes a clip share-worthy.
- Setup → turn → payoff structure
- Burned-in word-level captions
- All four aspect ratios per clip
- Per-platform AI titles
- Split-screen with auto-zoom on speaker
Sample output
Speaker-labelled. Click-to-seek. Cut into clips.
Same engine that generates the transcript also feeds the clip selector — moments are picked from the diarized text, not the waveform.
Export
One transcript. Five clean formats.
Every paid tier exports all five. The free tier exports TXT and SRT.
Plain text
De-ummed paragraphs. Ready to paste.
SRT captions
Word-level. Every video editor reads this.
WebVTT
HTML5 player + YouTube uploads.
Show notes
Formatted with chapters and pull-quotes.
Machine-readable
Per-word timing + speaker IDs.
Pricing
Honest pricing, no surprises.
Credits never expire. Upgrade or downgrade any month. Free tier resets daily — no signup, no card.
Free
$0/forever
Try every feature for 30 minutes a day. No card.
- 30 min / day
- Speaker labels included
- TXT + SRT export
- No history retention
Pay-as-you-go
$1/hour
Best for one-off projects. Credits never expire.
- $10 minimum top-up
- Every export format
- 365-day history
- API access
Pro
$8/month
Indie creators. 100 hours / month, all features.
- 100 hours / month
- Clips + every aspect ratio
- Branded captions
- Priority queue
Team
$29/month
Teams. 500 hours / month, shared workspace.
- 500 hours / month
- Shared library
- API + MCP for Claude
- Workspace billing
FAQ
Podcast clip maker questions, answered.
How does it handle multi-speaker clips?
Two-host or host-plus-guest episodes get a vertical split-screen render — host on top half, guest on bottom — with auto-zoom on whoever is currently speaking. The diarization model decides the cuts; you don't manually mark speakers.
What is 'story-arc' clip selection?
Our picker scores each candidate clip on whether it has a setup line, a turn (the surprise or pivot), and a payoff (the resolution). Clips without all three are filtered out. The result is clips a stranger can follow without context — which is the actual bar for shareability.
Do clips come with AI-named titles?
Yes. Each clip ships with a suggested title for TikTok, Shorts, LinkedIn, and X — three to four variations, hook-style, written from the actual content of the clip. Copy the one you want into the upload field.
Is there a Whipscribe watermark on the clips?
On the free tier — yes, a small bottom-right corner mark. On Pro and Team — no watermark. The captions, ratio, and content are identical across tiers; the only difference is the corner mark.
Can I add my own brand colors and logo to captions?
Yes on Pro and Team. Upload a logo PNG, set primary and accent caption colors, and pick a font from a curated list. All future clips render with that brand pack. Free tier uses the default Whipscribe caption style.
How many clips do I get from a one-hour episode?
Typically 5 to 9 high-quality clips, depending on how dense the conversation is. The picker is conservative — we'd rather give you 5 strong clips than 30 mediocre ones. You can re-run the selection with different settings if you want more or fewer.
Related
Related tools and pages.
Drop one episode. Get a week of clips.
Try WhipscribeOperated by Neugence Technology Pvt. Ltd. · contact@neugence.ai · Security · Privacy · Terms