Skip to content
AI Tools Finder

Pyramid Flow

Memory-efficient T2V via pyramidal flow matching.

Open Source 16–24 GB VRAMRuns locally
Actually FreeNo SignupOpen SourceWatermark-Free
Visit Pyramid FlowUpdated 2026-02-20 · Direct link

Hardware requirements

Runs locally · High-end GPU (16–24 GB)

16–24 GB VRAM
Min VRAM
16 GB
Rec. VRAM
24 GB
Min RAM
32 GB
Rec. RAM
64 GB
Disk
30 GB
GPU class
High-end GPU
12.1+No Apple SiliconGPU RequiredQuant: BF16, FP8

384p variant runs in 16 GB; 768p needs 24 GB.

Screenshot placeholder · Pyramid Flow

What is Pyramid Flow?

Pyramid Flow uses a multi-resolution pyramid + flow matching to generate 10-second 768p clips on a single 24 GB GPU — territory that previously required datacenter hardware. Quality is below Mochi / Wan but the VRAM efficiency is genuinely novel. MIT licensed.

Pros & cons

Pros

  • 10-second 768p clips on a single 24 GB card
  • Pyramidal training is a genuinely interesting architecture
  • MIT — most permissive video model license available

Cons

  • Motion / detail quality below Mochi & Wan
  • Smaller community than Hunyuan / Wan workflows

What's actually free?

MIT.

✓ Actually FreeNo SignupOpen SourceWatermark-Free

Alternatives

Mochi 1

Genmo's 10-B open-weight T2V — the first 'genuinely fluid' OSS video model.

Open Source 24–60 GB VRAM
Min VRAM
24 GB
GPU class
Workstation GPU
Quant
BF16
Actually FreeNo SignupOpen SourceWatermark-Free

CogVideoX 5B

Open-source text-to-video diffusion from THUDM.

Open Source 12–24 GB VRAM
Min VRAM
12 GB
GPU class
High-end GPU
Quant
FP16
Actually FreeNo SignupOpen SourceWatermark-Free

Wan 2.2

Open-weight video diffusion from Alibaba.

Open Source 12–48 GB VRAM
Min VRAM
12 GB
GPU class
Workstation GPU
Quant
FP16
Actually FreeNo SignupOpen SourceWatermark-Free