← model directory

cloudflare

meta/llama-4-scout-17b-16e-instruct

cloudflare/meta/llama-4-scout-17b-16e-instruct

↓ runs free on your own hardware

Meta's Llama 4 Scout is a 17 billion parameter model with 16 experts that is natively multimodal. These models leverage a mixture-of-experts architecture to offer industry-leading performance in text and image understanding.

specs & pricing
Type
text
Provider
cloudflare
Model ID
cloudflare/meta/llama-4-scout-17b-16e-instruct
Context window
131K tokens
Self-hostable
Yes — runs on your own GPU
Input price
$0.30 / 1M tokens
Output price
$0.94 / 1M tokens

Cloud price is billed from prepaid credits when a request fails over to the cloud. Open-weight models run free on GPUs you own — the gateway routes to your nodes first.