Pyramid Flow uses a multi-resolution pyramid + flow matching to generate 10-second 768p clips on a single 24 GB GPU — territory that previously required datacenter hardware. Quality is below Mochi / Wan but the VRAM efficiency is genuinely novel. MIT licensed.
Pros & cons
Pros
✓10-second 768p clips on a single 24 GB card
✓Pyramidal training is a genuinely interesting architecture
✓MIT — most permissive video model license available