How to Create a Talking Video with VisionStory

Dec 15, 2024

Illustration of the talking video creation process

1. Upload or Select a Character

Upload a front-facing photo with clearly visible shoulders and no obstructions for the best lip-sync and facial tracking. If you have already uploaded images or wish to try VisionStory’s sample library, you can also select an existing character from your collection.

Uploading a character image for video creation

2. Add Your Script or Audio

Next, decide what you want your character to say or express:

  • Type Text: Enter your desired dialogue directly in the text box.
  • Import/Record Audio: Upload a pre-recorded audio file or record new audio instantly.
  • URL Import: Paste a link (such as from YouTube or TikTok) to use audio from external sources.

3. Select or Clone a Voice

Once your script or audio is ready, choose the ideal voice:

  • Choose from 200+ AI Voices: Access the Voice Library and filter by language, gender, age, or style. Click the play icon to preview each voice.
  • Clone a Voice (Pro Plan or above): For a personalised voice, upload or record sample audio. VisionStory will create an AI-based replica for use in your projects.
Selecting or cloning a voice for the character

4. Configure Video Settings

Before generating your video, customise the visuals and output:

  • Quality:
    • Standard (no extra credits required)
    • HD (Pro Plan or above, additional credits apply)
  • Aspect Ratio: Choose from 9:16 (portrait), 16:9 (landscape), or 1:1 (square) based on your sharing platform.
  • Facial Expressions: Use the “Emotion” selector to set on-screen facial cues (e.g., cheerful, marketing, news). This affects the character’s expression, not the voice tone.
  • Green Screen (Pro Plan or above): Enable a solid green background for easy editing or compositing in other scenes.
Configuring video settings for talking videos

5. Generate Your Talking Video

When you are ready:

  • Preview Audio: Ensure the voice and pacing meet your expectations.
  • Check Credit Usage: Each 15 seconds of video uses 1 credit; HD and green screen features require extra credits.
  • Click “Generate Talking Video”: VisionStory will animate your character, synchronising lip movements with your script or audio.

6. Final Preview & Sharing

After processing, your video will appear in the Assets section. Here, you can:

  • Preview or Play the final video to review the result.
  • Rename the video title to keep your library organised.
  • Provide Feedback if you are not satisfied or have suggestions for improvement.
  • Share to X (formerly Twitter) or Facebook for instant social sharing.
  • Copy Link to share your video on other platforms.
  • Download the MP4 file for offline use or embedding on websites.
  • Delete the video if you no longer need it.

By following these steps, you can easily create engaging videos where your virtual character speaks with realistic lip-sync and expressive animations. With VisionStory, anyone in India can produce captivating, professional-quality talking videos in just a few minutes!