← model directory

qwen

Qwen: Qwen3 VL 32B Instruct

qwen/qwen3-vl-32b-instruct

↓ runs free on your own hardware

Qwen3-VL-32B-Instruct is a large-scale multimodal vision-language model designed for high-precision understanding and reasoning across text, images, and video. With 32 billion parameters, it combines deep visual perception with advanced text...

specs & pricing
Type
text
Provider
qwen
Model ID
qwen/qwen3-vl-32b-instruct
Capabilities
vision, tools
Context window
262K tokens
Self-hostable
Yes — runs on your own GPU
Input price
$0.11 / 1M tokens
Output price
$0.46 / 1M tokens

Cloud price is billed from prepaid credits when a request fails over to the cloud. Open-weight models run free on GPUs you own — the gateway routes to your nodes first.

Qwen: Qwen3 VL 32B Instruct — pricing, context & specs | Wide Area Intelligence