← model directory
cloudflare
qwen/qwen3-30b-a3b-fp8
cloudflare/qwen/qwen3-30b-a3b-fp8
↓ runs free on your own hardwareQwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models. Built upon extensive training, Qwen3 delivers groundbreaking advancements in reasoning, instruction-following, agent capabilities, and multilingual support.
specs & pricing
- Type
- text
- Provider
- cloudflare
- Model ID
- cloudflare/qwen/qwen3-30b-a3b-fp8
- Context window
- 33K tokens
- Self-hostable
- Yes — runs on your own GPU
- Input price
- $0.056 / 1M tokens
- Output price
- $0.37 / 1M tokens
Cloud price is billed from prepaid credits when a request fails over to the cloud. Open-weight models run free on GPUs you own — the gateway routes to your nodes first.