← model directory

qwen

Qwen: Qwen3 8B

qwen/qwen3-8b

↓ runs free on your own hardware

Qwen3-8B is a dense 8.2B parameter causal language model from the Qwen3 series, designed for both reasoning-heavy tasks and efficient dialogue. It supports seamless switching between "thinking" mode for math,...

specs & pricing
Type
text
Provider
qwen
Model ID
qwen/qwen3-8b
Capabilities
tools, reasoning
Context window
131K tokens
Self-hostable
Yes — runs on your own GPU
Input price
$0.055 / 1M tokens
Output price
$0.44 / 1M tokens

Cloud price is billed from prepaid credits when a request fails over to the cloud. Open-weight models run free on GPUs you own — the gateway routes to your nodes first.