cloudflare

meta/llama-3.3-70b-instruct-fp8-fast

Name: meta/llama-3.3-70b-instruct-fp8-fast
Brand: cloudflare
Price: 0.3223 USD

cloudflare/meta/llama-3.3-70b-instruct-fp8-fast

↓ runs free on your own hardware

Llama 3.3 70B quantized to fp8 precision, optimized to be faster.

specs & pricing

Type: text
Provider: cloudflare
Model ID: cloudflare/meta/llama-3.3-70b-instruct-fp8-fast
Context window: 24K tokens
Self-hostable: Yes — runs on your own GPU
Input price: $0.32 / 1M tokens
Output price: $2.48 / 1M tokens

Cloud price is billed from prepaid credits when a request fails over to the cloud. Open-weight models run free on GPUs you own — the gateway routes to your nodes first.

Try it in chat Run it on your own hardware