Taming Stable Diffusion for Lip Sync!
-
Updated
Jun 20, 2025 - Python
Taming Stable Diffusion for Lip Sync!
🤘 TT-NN operator library, and TT-Metalium low level kernel programming model.
An open source community implementation of the model from the paper: "Movie Gen: A Cast of Media Foundation Models". Join our community to help implement this model!
[ICCV 2025] FiVE-Bench: A Fine-grained Video Editing Benchmark for Evaluating Emerging Diffusion and Rectified Flow Models
Once the Adobe Sora API can be use, this repository will be updated soon. My other projects: https://videosora.app https://makeimage.ai
⚡ KILLA AI — Multimodal AI chat: text, image gen, video gen, DeepSearch & Knowledge Studio. React + TypeScript + Vite + Node.js. Open-source & Early Access.
Create a Waveform Video (usable on Youtube, Tiktok, etc.) from a WAV or MP3 file. Two output options: ultrafast generation (static background with optional title) and standard generation (dynamic background).
🛠️ Synchronize X-UI subscriptions and tunnel inbounds with SQLite, managing traffic, expiry, and user limits automatically every 30 seconds.
Add a description, image, and links to the video-gen topic page so that developers can more easily learn about it.
To associate your repository with the video-gen topic, visit your repo's landing page and select "manage topics."