How to Create a Talking Video with VisionStory

Dec 15, 2024

Illustration of the talking video creation process

1. Upload or Select a Character

Upload a front-facing photo with clearly visible shoulders for optimal lip synchronisation and facial tracking. If you have already uploaded images or wish to try VisionStory’s sample library, you can also select an existing character from your collection.

Uploading a character image for video creation

2. Add Your Script or Audio

Next, decide what your character will say or express:

  • Type Text: Enter your chosen dialogue directly into the text field.
  • Import/Record Audio: Upload a pre-recorded audio file or record new audio instantly.
  • URL Import: Paste a link (e.g., from YouTube or TikTok) to use audio from external sources.

3. Select or Clone a Voice

Once your script or audio is ready, choose the ideal voice:

  • Choose from 200+ AI Voices: Access the Voice Library, filtering by language, gender, age, and style. Click the play icon to preview each voice.
  • Clone a Voice (Pro Plan or above): For a bespoke voice, upload or record sample audio. VisionStory will create an AI-powered replica for use in your projects.
Selecting or cloning a voice for the character

4. Configure Video Settings

Before generating your video, customise the visuals and output:

  • Quality:
    • Standard (no extra credits required)
    • HD (Pro Plan or above, additional credits may apply)
  • Aspect Ratio: 9:16 (portrait), 16:9 (landscape), or 1:1 (square), depending on your intended platform.
  • Facial Expressions: Use the “Emotion” selector to adjust on-screen facial cues (e.g., cheerful, marketing, news). This controls the character’s expression, not the audio’s tone.
  • Green Screen (Pro Plan or above): Enable a solid green background if you wish to composite your character into other scenes later.
Configuring video settings for talking videos

5. Generate Your Talking Video

When you are ready:

  • Preview Audio: Ensure the voice and pacing meet your expectations.
  • Check Credit Usage: Each 15 seconds of video uses 1 credit; HD and green screen options require additional credits.
  • Click “Generate Talking Video”: VisionStory will animate your character, synchronising lip movements with your chosen script or audio.

6. Final Preview & Sharing

Once processing is complete, your video will appear in the Assets section. From here, you can:

  • Preview or Play the final video to review the result.
  • Rename the video title to keep your library organised.
  • Provide Feedback if you are not satisfied or wish to suggest improvements.
  • Share to X or Facebook directly for quick social posting.
  • Copy Link to share your video on other platforms.
  • Download the MP4 file for local storage or embedding on websites.
  • Delete the video if you no longer require it.

By following these steps, you can create engaging videos where your virtual character speaks with realistic lip-sync and expressive animation. With VisionStory, anyone can produce a captivating on-screen presence in just minutes!