Skip to content
AI Tools Finder

Mochi 1

Genmo's 10-B open-weight T2V — the first 'genuinely fluid' OSS video model.

Open Source 24–60 GB VRAMRuns locally
Actually FreeNo SignupOpen SourceWatermark-FreeAPI
Visit Mochi 1Updated 2026-03-10 · Direct link

Hardware requirements

Runs locally · Workstation GPU (32–48 GB)

24–60 GB VRAM
Min VRAM
24 GB
Rec. VRAM
60 GB
Min RAM
32 GB
Rec. RAM
64 GB
Disk
40 GB
GPU class
Workstation GPU
12.1+No Apple SiliconGPU RequiredQuant: BF16, FP8, GGUF

FP8 fits on 24 GB; full BF16 needs an A100 / H100 class card.

Screenshot placeholder · Mochi 1

What is Mochi 1?

Mochi 1 is Genmo's flagship open-weight text-to-video model. 10B parameters, Apache 2.0 license, generates 5.4-second 480p clips with motion fluidity that approaches closed models. Steep VRAM requirement at full precision; FP8 / GGUF ports run on a single 24 GB consumer card.

Pros & cons

Pros

  • Best fluid motion of any open-weight video model at launch
  • Apache 2.0 — no Community License revenue clauses
  • FP8 / GGUF ports drop it onto consumer GPUs

Cons

  • Full BF16 needs 60+ GB VRAM (datacenter-class)
  • 5.4-second clip cap is short

What's actually free?

Apache 2.0 — fully commercial-friendly.

✓ Actually FreeNo SignupOpen SourceWatermark-Free

Alternatives

Wan 2.2

Open-weight video diffusion from Alibaba.

Open Source 12–48 GB VRAM
Min VRAM
12 GB
GPU class
Workstation GPU
Quant
FP16
Actually FreeNo SignupOpen SourceWatermark-Free

HunyuanVideo

13B open-weight cinematic text-to-video.

Open Source 24–48 GB VRAM
Min VRAM
24 GB
GPU class
Workstation GPU
Quant
FP16
Actually FreeNo SignupOpen SourceWatermark-Free

LTX-Video

Real-time-ish open video diffusion from Lightricks.

Open Source 12–16 GB VRAM
Min VRAM
12 GB
GPU class
High-end GPU
Quant
FP16
Actually FreeNo SignupOpen SourceWatermark-Free

CogVideoX 5B

Open-source text-to-video diffusion from THUDM.

Open Source 12–24 GB VRAM
Min VRAM
12 GB
GPU class
High-end GPU
Quant
FP16
Actually FreeNo SignupOpen SourceWatermark-Free