Wan 2.2
Open-weight video diffusion from Alibaba.
- Min VRAM
- 12 GB
- GPU class
- Workstation GPU
- Quant
- FP16
Twelve gigabytes is the modern hobbyist sweet spot. Flux runs comfortably at FP8, 13B LLMs fit at Q5/Q6, and the smallest video diffusion models become viable. The RTX 3060 12 GB remains the best price-per-VRAM card ever made.
Sorted by best fit for this tier — tools designed around your VRAM budget first, then by our power-user score.
Open-weight video diffusion from Alibaba.
The standard SDXL/Flux LoRA training UI.
The INRIA original — train your own splats.
Tencent's open 3D generator — multi-view, PBR, ready-to-use meshes.
Real-time-ish open video diffusion from Lightricks.
Modern alternative trainer for SD/SDXL/Flux.
Stability's MMDiT flagship at 8B params.
Diffusion-based photorealistic upscaler.
Open-source text-to-video diffusion from THUDM.
Dead-simple Flux LoRA training in a Gradio UI.
Image-to-video diffusion — 25 frames, 14 or 25 steps.
Animation motion modules for ComfyUI.