Skip to content
AI Tools Finder

NVIDIA Cosmos

World-foundation models for physical AI.

Open Source 24–80 GB VRAMRuns locally
Actually FreeWatermark-FreeAPI
Visit NVIDIA CosmosUpdated 2026-05-12 · Direct link

Hardware requirements

Runs locally · Datacenter GPU (80 GB+)

24–80 GB VRAM
Min VRAM
24 GB
Rec. VRAM
80 GB
Min RAM
32 GB
Rec. RAM
64 GB
Disk
80 GB
GPU class
Datacenter GPU
12.0+No Apple SiliconGPU RequiredQuant: FP16, BF16

4B variant on 24 GB; 14B + tokenizer pipeline needs H100-class.

Screenshot placeholder · NVIDIA Cosmos

What is NVIDIA Cosmos?

Cosmos is NVIDIA's open foundation-model family for "physical AI" — robotics, autonomous vehicles, simulation. Includes diffusion and autoregressive video generators (Cosmos-Predict), tokenizers, and guardrails. Aimed at synthetic-data generation for physical-world ML rather than artistic video.

Pros & cons

Pros

  • Purpose-built for synthetic data in robotics / AV pipelines
  • Physics-aware video generation (camera motion, object permanence)
  • Full pipeline: tokenizer, generator, guardrails
  • Open weights at multiple sizes (4B → 14B)

Cons

  • Hardware floor is high — 48 GB+ for the larger variants
  • Not aimed at general creative video — different model than Wan/Hunyuan
  • Licence is open but not OSI-approved

What's actually free?

NVIDIA Open Model License — permissive but not OSI-OSS.

✓ Actually FreeWatermark-Free

Alternatives

Wan 2.2

Open-weight video diffusion from Alibaba.

Open Source 12–48 GB VRAM
Min VRAM
12 GB
GPU class
Workstation GPU
Quant
FP16
Actually FreeNo SignupOpen SourceWatermark-Free

HunyuanVideo

13B open-weight cinematic text-to-video.

Open Source 24–48 GB VRAM
Min VRAM
24 GB
GPU class
Workstation GPU
Quant
FP16
Actually FreeNo SignupOpen SourceWatermark-Free

LTX-Video

Real-time-ish open video diffusion from Lightricks.

Open Source 12–16 GB VRAM
Min VRAM
12 GB
GPU class
High-end GPU
Quant
FP16
Actually FreeNo SignupOpen SourceWatermark-Free