← model directory
cloudflare
meta/llama-4-scout-17b-16e-instruct
cloudflare/meta/llama-4-scout-17b-16e-instruct
↓ runs free on your own hardwareMeta's Llama 4 Scout is a 17 billion parameter model with 16 experts that is natively multimodal. These models leverage a mixture-of-experts architecture to offer industry-leading performance in text and image understanding.
specs & pricing
- Type
- text
- Provider
- cloudflare
- Model ID
- cloudflare/meta/llama-4-scout-17b-16e-instruct
- Context window
- 131K tokens
- Self-hostable
- Yes — runs on your own GPU
- Input price
- $0.30 / 1M tokens
- Output price
- $0.94 / 1M tokens
Cloud price is billed from prepaid credits when a request fails over to the cloud. Open-weight models run free on GPUs you own — the gateway routes to your nodes first.