The Make Calcutta Relevant Again (MCRA) Podcast started as an idea — a group of Kolkata creatives who wanted to tell their city's stories without waiting for permission from anywhere else. When the founders approached me to join as their lead video editor and cinematographer, the podcast had around 10,000 followers. Real conversations. Good energy. But the channel wasn't growing the way the content deserved.
Over the next year and a half, I built a short-form video editing and content strategy from the ground up — engineering hooks, systematising clip selection, building a repurposing workflow, and iterating on performance data weekly. The result: the channel grew from 10,000 to 25,000 followers, and total views crossed 10 million.
This is exactly what I did. Not theory — the actual framework I used as a working video editor in Kolkata who was accountable for every frame of this project.
The Problem Most Podcast Video Editors Get Wrong
When most video editors work on podcasts, they think their job ends at colour grading the long-form episode and uploading it to YouTube. They deliver the full cut and move on. That approach leaves the most powerful growth lever completely unused.
Podcast video editing done right is a two-part job: producing the long-form episode, and engineering the short-form clips that drive discovery. The long-form is the source material. The real growth engine — especially on Instagram Reels, YouTube Shorts, and TikTok — is short-form content built from that source material.
Short-form video editing is a completely different discipline. A 45-minute podcast has the luxury of a slow build. A 60-second Reel has exactly 1.5 seconds to earn the next 59. This is where most podcast producers fail — they either don't clip at all, or they clip poorly, choosing segments that feel interesting in context but make no sense as standalone content.
A great short-form video editor doesn't just find the best moments from a podcast. They reconstruct those moments into self-contained pieces of content with their own hook, arc, and purpose.
Building the MCRA Short-Form Video Strategy
My first move was to establish a consistent production framework. For every full MCRA episode, I committed to producing:
- 1–2 horizontal long-form edits for YouTube (full episodes, colour graded, sound designed)
- 4–6 short-form vertical clips for Instagram Reels and YouTube Shorts
- 1 high-CTR thumbnail per episode
- Periodic TikTok versions adapted for that platform's native format
The short-form clips were not random cuts from the conversation. Each one was engineered as a standalone piece of content. Before I opened Premiere Pro, I had already identified the theme, the hook, and the intended viewer reaction for each clip. Strategy first. Editing second.
Hook Engineering: The First 3 Seconds of Every Clip
The most important edit in any piece of short-form video is the first 2–3 seconds. This is where you win or lose your viewer. I developed three hook types for MCRA, rotating them based on content and testing performance weekly:
1. The Statement Hook
A bold, counter-intuitive statement from the guest or host — delivered before any context or introduction. A clip that opens with "Kolkata is the most underestimated creative city in the world" before we see who's speaking. The statement creates curiosity. The edit creates urgency.
2. The Question Hook
A question the viewer feels they need answered. For MCRA, questions about Kolkata's identity and its relationship with Indian pop culture consistently drove the highest retention. The key is specificity — generic questions perform poorly; pointed ones that signal a real answer is coming perform very well.
3. The Visual Hook
A moment of visible emotion — a laugh, a pause, a strong reaction — used as the first frame before any context. No talking head, no title card. Just a human moment that compels the viewer to understand what caused it. Highly effective for clips in the 90–180 second range.
The Editing Techniques That Drove Retention
Short-form video editing for retention is not about cutting faster — it's about controlling information flow. Here's what I applied consistently across MCRA's short-form content:
Jump Cuts with Purpose
Every cut should do one of two things: remove dead space or create rhythm. I mapped the energy arc of each clip before editing so every jump cut was intentional — not decorative.
B-Roll as Visual Punctuation
Raw podcast footage of two people talking is hard to sustain at short-form scale. I layered Kolkata city footage, archival images, and text overlays throughout. B-roll gives the eye somewhere to go and reinforces the verbal content emotionally.
Captions as a Design System
85% of social video is watched without sound — open captions are non-negotiable. But captions done well go beyond accessibility. I treated captions as a design element: font choice, sizing, placement, timing, and emphasis were all deliberate decisions. Properly designed captions increase watch time by 20–30% compared to auto-generated alternatives.
Sound Design and Music
Music choice and audio quality significantly affect retention, yet most podcast video editors treat them as afterthoughts. For MCRA, I used music from Kolkata-based artists and ambient city sounds to reinforce the brand identity in every clip. The audio felt like Kolkata — because it was.
The Data Loop: Track, Adapt, Repeat
The most important habit I built into the MCRA workflow was weekly performance review. Every clip was tracked: watch time curves, drop-off points, average percentage viewed, shares, saves, comments. This data shaped every editing and content decision the following week.
Patterns emerged clearly over time. Clips about Kolkata's creative identity outperformed others by 3–4x. Statement hooks outperformed question hooks by ~20% for this specific audience. Clips between 45 and 75 seconds outperformed longer cuts by 40% on average retention. This wasn't instinct — it was a short-form video editing discipline built on evidence.
The Results
- Followers: 10,000 → 25,000 (150% growth)
- Individual clips regularly hit 200K–500K views on Instagram Reels
- Best-performing clip: 2.4 million views on YouTube
- Total platform views: over 10 million
- Sponsor conversations and guest quality improved as visibility drove credibility
What Any Brand or Creator Can Take From This
If you're running a podcast and not investing in disciplined short-form video editing, you're leaving your best growth lever unused:
- Every long-form piece of content contains 4–8 strong short-form clips worth distributing
- Short-form clips need their own editing philosophy — not just cuts from long-form
- Hook quality determines 80% of short-form performance
- Captions, sound design, and B-roll are not optional for a competitive short-form presence
- Track performance weekly and let data drive editing decisions
- Consistency over 6–12 months beats chasing viral moments every time
The difference between a podcast that grows and one that stagnates is almost never the quality of the conversation. It's almost always the quality of the video editing strategy built around it.
Want Results Like This for Your Brand?
I'm Sarthak Dey — a video editor and short-form content strategist based in Kolkata, India. I've done this for MCRA. I can do it for you.
Frequently Asked Questions
What is podcast video editing?
Podcast video editing is the process of transforming recorded podcast footage into long-form full episodes and short-form clips for social distribution. A skilled podcast video editor engineers hooks, manages pacing, adds captions and B-roll, and builds a short-form content strategy from the long-form source material to drive consistent growth.
How do you grow a podcast with short-form video?
Identify 4–8 clips per episode, engineer a strong hook for the first 2–3 seconds, add captions and B-roll, and distribute consistently on Instagram Reels, YouTube Shorts, and TikTok. Track performance weekly and adapt. This is the exact strategy Sarthak Dey used to grow MCRA Podcast from 10,000 to 25,000 followers and 10 million views.
How long should short-form podcast clips be?
For Instagram Reels and YouTube Shorts, 45–75 seconds performs best for most podcast content. Based on MCRA data, clips in this range outperformed longer cuts by approximately 40% on average retention. Clips over 90 seconds can work if the hook is exceptionally strong.
Who is Sarthak Dey?
Sarthak Dey is a video editor and short-form content strategist based in Kolkata, India, and the founder of Clipforge and Dropadsify. He served as lead video editor for MCRA Podcast, growing it from 10,000 to 25,000 followers and over 10 million views through a systematic short-form video editing and content repurposing strategy.