← model directory

cloudflare

nvidia/nemotron-3-120b-a12b

cloudflare/nvidia/nemotron-3-120b-a12b

↓ runs free on your own hardware

NVIDIA Nemotron 3 Super is a hybrid MoE model with leading accuracy for multi-agent applications and specialized agentic AI systems.

specs & pricing
Type
text
Provider
cloudflare
Model ID
cloudflare/nvidia/nemotron-3-120b-a12b
Context window
256K tokens
Self-hostable
Yes — runs on your own GPU
Input price
$0.55 / 1M tokens
Output price
$1.65 / 1M tokens

Cloud price is billed from prepaid credits when a request fails over to the cloud. Open-weight models run free on GPUs you own — the gateway routes to your nodes first.