← model directory
cloudflare
nvidia/nemotron-3-120b-a12b
cloudflare/nvidia/nemotron-3-120b-a12b
↓ runs free on your own hardwareNVIDIA Nemotron 3 Super is a hybrid MoE model with leading accuracy for multi-agent applications and specialized agentic AI systems.
specs & pricing
- Type
- text
- Provider
- cloudflare
- Model ID
- cloudflare/nvidia/nemotron-3-120b-a12b
- Context window
- 256K tokens
- Self-hostable
- Yes — runs on your own GPU
- Input price
- $0.55 / 1M tokens
- Output price
- $1.65 / 1M tokens
Cloud price is billed from prepaid credits when a request fails over to the cloud. Open-weight models run free on GPUs you own — the gateway routes to your nodes first.