基于 FFmpeg + Qt 的多线程媒体播放器 + HTTP-MPEG-TS 低延迟推流器,实现音视频同步、音频混合、seek 续流与浏览器实时播放
-
Updated
Jun 16, 2026 - C++
基于 FFmpeg + Qt 的多线程媒体播放器 + HTTP-MPEG-TS 低延迟推流器,实现音视频同步、音频混合、seek 续流与浏览器实时播放
HighSync: High-Quality Lip Synchronization via Latent Diffusion Models
Production-ready Multimodal Lip Sync Detection & Deepfake Detection System. Detects audio-video synchronization mismatches using deep learning (PyTorch) with a scalable FastAPI-based inference pipeline. Optimized for real-time processing,low false positives, and robust performance on noisy speech segments. Built for video forensics,synthetic media
Multimodal pipeline for turning PPTs or documents into dubbed videos with voice cloning, timeline alignment, and retimed video composition.
Add a description, image, and links to the audio-video-sync topic page so that developers can more easily learn about it.
To associate your repository with the audio-video-sync topic, visit your repo's landing page and select "manage topics."