Best practices
Lip sync quality depends on the source footage, the audio, and the tier you choose. These are the highest-impact improvements you can make.Video best practices
- prefer front-facing or near-front-facing shots
- keep the mouth and lower face visible
- avoid heavy obstructions such as hands, microphones, or hair over the lips
- use stable footage when possible
- keep lighting even across the face
- prefer one clearly visible speaker when possible
Audio best practices
- use clean speech with minimal background noise
- avoid overlapping speakers
- isolate vocals when possible
- make sure the translated audio already sounds natural before enabling lip sync
Media recommendations
- prefer MP4 or MOV for video
- prefer WAV or MP3 for audio
- 1080p is usually the best balance between quality and speed
- videos above 4K are not a good fit for this workflow
When quality issues appear
If the result looks weak, check these first:- the face angle may be too extreme
- the mouth may be partially hidden
- the source footage may be too shaky
- the audio may contain noise or multiple speakers
- the selected tier may not match the quality target
Choose the right tier
- use
Lip Syncfor speed and general-purpose output - use
Lip Sync Prowhen you want more detail - use
Lip Sync Studiowhen you are optimizing for the best short premium result

