Audio Conditioned LipSync with Latent Diffusion Models
Generate lip-synced video for audio and reference video
Create talking face animations from still images and audio