Best practices

Lip sync quality depends on the source footage, the audio, and the tier you choose. These are the highest-impact improvements you can make.

Video best practices

prefer front-facing or near-front-facing shots
keep the mouth and lower face visible
avoid heavy obstructions such as hands, microphones, or hair over the lips
use stable footage when possible
keep lighting even across the face
prefer one clearly visible speaker when possible

Audio best practices

use clean speech with minimal background noise
avoid overlapping speakers
isolate vocals when possible
make sure the translated audio already sounds natural before enabling lip sync

Media recommendations

prefer MP4 or MOV for video
prefer WAV or MP3 for audio
1080p is usually the best balance between quality and speed
videos above 4K are not a good fit for this workflow

When quality issues appear

If the result looks weak, check these first:

the face angle may be too extreme
the mouth may be partially hidden
the source footage may be too shaky
the audio may contain noise or multiple speakers
the selected tier may not match the quality target

Choose the right tier

use Lip Sync for speed and general-purpose output
use Lip Sync Pro when you want more detail
use Lip Sync Studio when you are optimizing for the best short premium result

Use cases Tools overview