Introduction to the Video Podcast Feature
Feb 19, 2025

Looking for a fast and engaging way to turn your audio podcasts into visually rich experiences? Discover VisionStory’s Video Podcast feature! Effortlessly transform any two-person audio conversation into an immersive video podcast—complete with AI-powered scene generation, customisable characters, intelligent shot selection, and more. Here’s how it works for Indian content creators and teams:
1. Upload or Import Your Audio
Begin by uploading an audio file (such as .mp3 or .wav) or simply paste a link from YouTube, TikTok, or other supported platforms. Once your audio is uploaded, you can preview and trim it to highlight the best parts of your conversation—all within VisionStory’s user-friendly interface.

2. Select a Scene and Characters
Next, choose a scene to set the mood for your podcast—whether it’s a cosy studio, a modern office, or a virtual news desk. Then, select two speaker characters from your previously uploaded images, or add new ones. You can even use animal images for unique avatars!

3. AI-Generated Storyboard
After uploading your audio and choosing your characters, VisionStory’s AI takes over with smart segmenting and automatic shot selection:
- Audio segmentation: The system analyses the conversation, detecting when each speaker is talking.
- Automatic shot selection: Each audio segment is matched with the most suitable shot type:
- Single-person close-up to highlight a speaker’s expression
- Single-person mid-shot for a balanced view
- Two-person shot when both speakers interact
These storyboards are created automatically—ideal for anyone seeking professional results without advanced video editing skills.

4. Fine-Tune Your Scenes and Voices
Within the storyboard editor, you can customise each shot as per your preference:
- Switch shot types: Move between close-up, mid-shot, or two-person shots for the best visual flow.
- Select alternative AI voices for each host to match your desired tone or language, including Indian languages like Hindi, Tamil, Telugu, and more.
- Swap characters: Instantly change which character appears in each segment for optimal storytelling.

5. One-Click Aspect Ratio Switching
Creating content for multiple platforms? Easily switch between 16:9 (landscape) and 9:16 (vertical) formats with a single click. All scenes, characters, and shots automatically adjust to the new aspect ratio—ensuring your video podcast looks professional on YouTube, Instagram Reels, and other Indian social platforms.

6. Generate Your Final Video
Once you’re happy with your storyboard and settings, simply click Generate to produce your complete video podcast. VisionStory’s fast rendering engine brings together your background, characters, audio, and camera transitions. In just a few moments, your AI-powered video podcast is ready to engage your audience!
Preparing Your Podcast Audio & Key Usage Tips
1. Getting Your Audio
- No ready-made podcast file? Use tools like NotebookLM by Google to generate speech audio from text.
- VisionStory will soon offer a similar service, allowing you to create podcasts directly from text within the platform.
2. Speaker Separation Limitations
- Currently, the system cannot perfectly separate overlapping voices. If two speakers talk at the same time, the voice changer feature may not work as expected.
- For best results, use clear audio where only one person speaks at a time.
3. Subscription Requirement
All users can upload podcast audio and generate a storyboard with AI-powered speakers, scenes, and shots. However, generating the final podcast video is available to Pro Plan and above subscribers. If you’re not a subscriber yet, consider upgrading to unlock this feature.
4. Video Length & Credits
- Currently, generated video podcasts are limited to 10 minutes in length, regardless of your subscription tier.
- Monitor your credit usage as per your plan; longer or more complex videos will consume additional credits.
Why Choose VisionStory’s Video Podcast Feature?
1. Versatile Use Cases
- Content Creators: Add a visual dimension to interviews or co-hosted shows for Indian audiences.
- Marketing Teams: Promote products or host discussions that engage viewers on Indian social media platforms.
- Educators & Trainers: Create interactive lesson recaps or online webinars with a personal touch.
2. AI-Powered Editing
Save hours of manual editing and shot selection. VisionStory’s advanced algorithms handle the technical work for you.
3. Highly Customisable
From choosing backgrounds to refining voices and aspect ratios, you have full control over the final output.
4. Professional Quality, Minimal Effort
Produce polished, dynamic video content without advanced editing skills or a large production team.
Transform your two-person conversations into immersive video podcasts in just a few steps. With VisionStory’s AI-driven technology, creating professional, visually engaging podcast episodes for Indian audiences has never been easier!