← model directory

z-ai

Z.ai: GLM 4.6V

z-ai/glm-4.6v

↓ runs free on your own hardware

GLM-4.6V is a large multimodal model designed for high-fidelity visual understanding and long-context reasoning across images, documents, and mixed media. It supports up to 128K tokens, processes complex page layouts...

specs & pricing
Type
text
Provider
z-ai
Model ID
z-ai/glm-4.6v
Capabilities
vision, tools, reasoning
Context window
131K tokens
Self-hostable
Yes — runs on your own GPU
Input price
$0.33 / 1M tokens
Output price
$0.99 / 1M tokens

Cloud price is billed from prepaid credits when a request fails over to the cloud. Open-weight models run free on GPUs you own — the gateway routes to your nodes first.