X-Dub A context-rich visual dubbing. Video-to-video editing — @TokenDiff

@TokenDiff443 подп.

41просмотров

9.3%от подписчиков

28 марта 2026 г.

🎬 ВидеоScore: 45

X-Dub A context-rich visual dubbing. Video-to-video editing approach to synchronize lip movements with new audio while preserving identity and handling occlusions. 🔴Noise-level specialized LoRA experts for structure, lip-sync, and texture refinement. 🔴1B-parameter DiT + Whisper audio encoding. 🔴 96.36% success rate on complex dataset 👉 Project 👉 GitHub 👉 Model

просмотров

368

символов

Нет

эмодзи

Да

медиа

Другие посты @TokenDiff

LTX2.3 I2V/T2V ID-LoRA Unlike using custom spoken audio input that strips the ambient sound away,👁 53 RealRestorer A new king of open-source image rescue. An all-in-one DiT (Step1X-Edit) that handles👁 53 MACRO Fixes multi-reference image generation by providing a large-scale dataset and benchmark desig👁 50 ComfyUI-DaVinci-MagiHuman 🔴Block-level CPU/GPU swapping 🔴Async CUDA prefetching 🔴Distill mode 🔴108👁 50 Cohere Transcribe Local ASR just got a massive efficiency upgrade. Hit #1 on the ASR leaderboard (5👁 49

Все посты канала →

Аналитика канала База постов