← model directory

z-ai

Z.ai: GLM 4.6

z-ai/glm-4.6

↓ runs free on your own hardware

Compared with GLM-4.5, this generation brings several key improvements: Longer context window: The context window has been expanded from 128K to 200K tokens, enabling the model to handle more complex...

specs & pricing
Type
text
Provider
z-ai
Model ID
z-ai/glm-4.6
Capabilities
tools, reasoning
Context window
203K tokens
Self-hostable
Yes — runs on your own GPU
Input price
$0.47 / 1M tokens
Output price
$1.91 / 1M tokens

Cloud price is billed from prepaid credits when a request fails over to the cloud. Open-weight models run free on GPUs you own — the gateway routes to your nodes first.