← model directory

cloudflare

qwen/qwen3-30b-a3b-fp8

cloudflare/qwen/qwen3-30b-a3b-fp8

↓ runs free on your own hardware

Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models. Built upon extensive training, Qwen3 delivers groundbreaking advancements in reasoning, instruction-following, agent capabilities, and multilingual support.

specs & pricing
Type
text
Provider
cloudflare
Model ID
cloudflare/qwen/qwen3-30b-a3b-fp8
Context window
33K tokens
Self-hostable
Yes — runs on your own GPU
Input price
$0.056 / 1M tokens
Output price
$0.37 / 1M tokens

Cloud price is billed from prepaid credits when a request fails over to the cloud. Open-weight models run free on GPUs you own — the gateway routes to your nodes first.