Introduction to the Video Podcast Feature
Feb 19, 2025

Looking for a fast and engaging way to turn your audio podcasts into dynamic visual content? Discover VisionStory’s Video Podcast feature! Effortlessly transform any two-person audio conversation into an immersive video podcast—complete with AI-powered scene generation, customizable characters, intelligent shot selection, and more. Here’s how it works:
1. Upload or Import Your Audio
Begin by uploading an audio file (such as .mp3 or .wav) or pasting a link from YouTube, TikTok, or other supported platforms. Once your audio is uploaded, you can preview and trim it to highlight the best parts of your conversation—all within VisionStory’s user-friendly interface.

2. Choose a Scene and Characters
Select a scene to set the mood for your podcast—whether it’s a cozy studio, a modern office, or a virtual news desk. Then, pick two speaker characters from your previously uploaded images, or add new ones for a fresh look.

3. AI-Generated Storyboard
After uploading your audio and selecting your characters, VisionStory’s AI takes over with smart segmenting and automatic shot selection:
- Audio segmentation: The system analyzes the conversation, detecting when each speaker is talking.
- Automatic shot selection: Each audio segment is paired with the most suitable shot type:
- Single-person close-up to highlight a speaker’s expression
- Single-person mid-shot for a balanced view
- Two-person shot when both speakers interact
These storyboards are created automatically—ideal for anyone seeking professional results without advanced editing skills.

4. Fine-Tune Scenes and Voices
Within the storyboard editor, you can customize each shot to your preference:
- Switch shot types: Change from close-up to mid-shot, or use a two-person shot for both hosts.
- Select alternative AI voices for each host to match your desired tone or style.
- Swap characters: Instantly switch which character appears in each segment for optimal visual flow.

5. One-Click Aspect Ratio Switching
Creating content for multiple platforms? Easily toggle between 16:9 (landscape) and 9:16 (vertical) formats. The scene, characters, and shots all automatically adjust to the new aspect ratio—ensuring your video looks polished on every platform.

6. Generate Your Final Video
Once you’re happy with your storyboard and settings, simply click Generate to produce your complete video podcast. VisionStory’s fast rendering engine brings together your background scene, characters, audio, and camera transitions. In just moments, your immersive, AI-powered video podcast will be ready to engage your audience!
Preparing Your Podcast Audio & Key Usage Tips
1. Getting Your Audio
- No podcast audio file yet? Use tools like NotebookLM by Google to generate speech audio from text.
- VisionStory will soon offer a similar service, allowing you to create podcasts directly from text on our platform.
2. Speaker Separation Limitations
- Currently, our system can’t perfectly separate overlapping voices. If two hosts speak at the same time, the voice changer feature may not work as intended.
- For best results, use clear audio where only one person speaks at a time.
3. Subscription Requirement
While anyone can upload podcast audio and generate a storyboard with AI-powered speakers, scenes, and shots, final video podcast generation is available to Pro Plan and above subscribers. If you’re not a member yet, consider subscribing to unlock this feature.
4. Video Length & Credits
- Currently, generated videos are limited to 10 minutes in length, regardless of your subscription tier.
- Monitor your credit usage according to your plan; longer or more complex videos will use additional credits.
Why Choose VisionStory’s Video Podcast Feature?
1. Versatile Use Cases
- Content Creators: Effortlessly add a visual element to interviews or co-hosted shows.
- Marketing Teams: Promote products or host discussions that engage audiences on social media.
- Educators & Trainers: Create engaging lesson recaps or remote webinars with a personable touch.
2. AI-Powered Editing
Save hours of manual editing and shot selection. VisionStory’s algorithms handle the technical details for you.
3. Highly Customizable
From backgrounds to voices and aspect ratios, you have full control over the final look and feel.
4. Professional Quality, Minimal Effort
Produce polished, dynamic video content without advanced editing skills or a full production team.
Transform your two-person conversations into immersive video podcasts in just a few steps. With VisionStory’s AI-driven technology, creating professional, visually engaging podcast episodes has never been easier for Canadian creators!