← model directory
qwen
Qwen: Qwen3 VL 32B Instruct
qwen/qwen3-vl-32b-instruct
↓ runs free on your own hardwareQwen3-VL-32B-Instruct is a large-scale multimodal vision-language model designed for high-precision understanding and reasoning across text, images, and video. With 32 billion parameters, it combines deep visual perception with advanced text...
specs & pricing
- Type
- text
- Provider
- qwen
- Model ID
- qwen/qwen3-vl-32b-instruct
- Capabilities
- vision, tools
- Context window
- 262K tokens
- Self-hostable
- Yes — runs on your own GPU
- Input price
- $0.11 / 1M tokens
- Output price
- $0.46 / 1M tokens
Cloud price is billed from prepaid credits when a request fails over to the cloud. Open-weight models run free on GPUs you own — the gateway routes to your nodes first.