← model directory

qwen

Qwen: Qwen3 VL 8B Thinking

qwen/qwen3-vl-8b-thinking

↓ runs free on your own hardware

Qwen3-VL-8B-Thinking is the reasoning-optimized variant of the Qwen3-VL-8B multimodal model, designed for advanced visual and textual reasoning across complex scenes, documents, and temporal sequences. It integrates enhanced multimodal alignment and...

specs & pricing
Type
text
Provider
qwen
Model ID
qwen/qwen3-vl-8b-thinking
Capabilities
vision, tools, reasoning
Context window
256K tokens
Self-hostable
Yes — runs on your own GPU
Input price
$0.13 / 1M tokens
Output price
$1.50 / 1M tokens

Cloud price is billed from prepaid credits when a request fails over to the cloud. Open-weight models run free on GPUs you own — the gateway routes to your nodes first.