Modal
Serverless Python for GPU workloads.
Freemium 16–80 GB VRAM
- Min VRAM
- 16 GB
- GPU class
- Datacenter GPU
- Quant
- —
Actually FreeWatermark-FreeHobbyist-FriendlyAPI
Serverless GPU functions — deploy a Python file, get an HTTPS endpoint.
Beam treats GPU functions the way Vercel treats web functions: write Python, run `beam deploy`, get an HTTPS endpoint with autoscaling. Targeted at the long tail between Replicate (run someone else's model) and Modal (do everything in code).
Free monthly credits; pay-per-second beyond.
Serverless Python for GPU workloads.
Run any open-source model with one API call.
On-demand GPU pods for ComfyUI, vLLM, training.