Skip to main content

Lip Sync

Synchronize video avatar lips with audio input for natural-looking talking head videos.

Try it out

Test the Lip Sync API directly in your browser with our interactive playground.

Overview

The Lip Sync API uses advanced AI to precisely match lip movements with speech audio, creating natural and convincing talking head videos.

API Reference

Endpoint

POST https://api.tryvinci.com/vincils

Headers

NameTypeDescription
X-User-IDstringRequired. Your user ID

Request Body

Multipart form data with:

ParameterTypeDescription
audioFileAudio file containing speech
videoFileVideo file to synchronize

Response

{
url: string, // URL to the synchronized video
media_type: string, // "video/mp4"
user_id: string // The user ID used for the request
}

Code Examples

import requests

url = "https://api.tryvinci.com/vincils"
headers = {"X-User-ID": "your-user-id"}
files = {
"audio": open("speech.wav", "rb"),
"video": open("video.mp4", "rb")
}

response = requests.post(url, headers=headers, files=files)
result = response.json()
print(f"Synchronized Video URL: {result['url']}")

Input Requirements

Video Requirements
  • Format: MP4, MOV
  • Resolution: 720p or higher recommended
  • Clear, front-facing view of the speaker
Audio Requirements
  • Format: WAV, MP3, M4A
  • Clear speech with minimal background noise
  • Matching duration with video
Duration Limits
  • Optimal length: 5-60 seconds per request
  • Longer videos should be split into segments

Best Practices

Video Quality

Use high-quality, well-lit video footage.

Audio Clarity

Ensure clear speech audio for best sync.

Performance

For best results, ensure the speaker's face is clearly visible and well-lit throughout the video.