← model directory
qwen
Qwen: Qwen3 8B
qwen/qwen3-8b
↓ runs free on your own hardwareQwen3-8B is a dense 8.2B parameter causal language model from the Qwen3 series, designed for both reasoning-heavy tasks and efficient dialogue. It supports seamless switching between "thinking" mode for math,...
specs & pricing
- Type
- text
- Provider
- qwen
- Model ID
- qwen/qwen3-8b
- Capabilities
- tools, reasoning
- Context window
- 131K tokens
- Self-hostable
- Yes — runs on your own GPU
- Input price
- $0.055 / 1M tokens
- Output price
- $0.44 / 1M tokens
Cloud price is billed from prepaid credits when a request fails over to the cloud. Open-weight models run free on GPUs you own — the gateway routes to your nodes first.