Wan 2.2
Open-weight video diffusion from Alibaba.
Open Source 12–48 GB VRAM
- Min VRAM
- 12 GB
- GPU class
- Workstation GPU
- Quant
- FP16
Actually FreeNo SignupOpen SourceWatermark-Free
Image-to-video diffusion — 25 frames, 14 or 25 steps.
Runs locally · High-end GPU (16–24 GB)
SVD-XT 25-frame needs ~16 GB; 12 GB viable with lower batch.
Stability AI's open-weight image-to-video model. Feed it a still image, get back a 25-frame clip with plausible camera motion and scene dynamics. Two variants: SVD (14 frames) and SVD-XT (25 frames). Image-conditioning only — no text prompt control over motion.
Stability AI Community License (free for non-commercial / small revenue).
Open-weight video diffusion from Alibaba.
13B open-weight cinematic text-to-video.
Real-time-ish open video diffusion from Lightricks.
Open-source text-to-video diffusion from THUDM.