Skip to main content

Best practices

Lip sync quality depends on the source footage, the audio, and the tier you choose. These are the highest-impact improvements you can make.

Video best practices

  • prefer front-facing or near-front-facing shots
  • keep the mouth and lower face visible
  • avoid heavy obstructions such as hands, microphones, or hair over the lips
  • use stable footage when possible
  • keep lighting even across the face
  • prefer one clearly visible speaker when possible

Audio best practices

  • use clean speech with minimal background noise
  • avoid overlapping speakers
  • isolate vocals when possible
  • make sure the translated audio already sounds natural before enabling lip sync

Media recommendations

  • prefer MP4 or MOV for video
  • prefer WAV or MP3 for audio
  • 1080p is usually the best balance between quality and speed
  • videos above 4K are not a good fit for this workflow

When quality issues appear

If the result looks weak, check these first:
  • the face angle may be too extreme
  • the mouth may be partially hidden
  • the source footage may be too shaky
  • the audio may contain noise or multiple speakers
  • the selected tier may not match the quality target

Choose the right tier

  • use Lip Sync for speed and general-purpose output
  • use Lip Sync Pro when you want more detail
  • use Lip Sync Studio when you are optimizing for the best short premium result