This guide helps you run Wav2Lip locally on a Windows machine without GPU using Conda.
- Windows 10/11
- Miniconda / Anaconda
- FFmpeg (installed and added to PATH)
git clone https://github.com/Rudrabha/Wav2Lip.git
cd Wav2Lip
conda env create -f environment.yml
conda activate wav2lip
👉 No need to install requirements.txt
Download model file and place here:
Wav2Lip/checkpoints/wav2lip.pth
OR (for low-end systems):
Wav2Lip/checkpoints/Wav2Lip-SD-NOGAN.pt
python inference.py --checkpoint_path checkpoints\wav2lip.pth --face assets\face.jpg --audio assets\test.wav --outfile output.mp4 --resize_factor 3 --nosmooth
Sometimes audio is not merged automatically.
Run manually:
ffmpeg -y -i temp\result.avi -i assets\test.wav -c:v copy -c:a aac output_final.mp4
- CPU execution is slow (5–15 min for short video)
- Use short clips (≤10 sec)
- Prefer image input over video for better performance
- Lower resolution improves speed
- Use
--resize_factor 3or4 - Use
.jpginstead of video - Close other apps while running
Use:
assets/test.wav(audio)assets/face.jpg(image)
pip install tqdm opencv-python librosa
- Ensure FFmpeg is installed
- Merge manually (see above)
.pthworks directly.ptrequires small code patch