← model directory

z-ai

Z.ai: GLM 4.5

z-ai/glm-4.5

↓ runs free on your own hardware

GLM-4.5 is our latest flagship foundation model, purpose-built for agent-based applications. It leverages a Mixture-of-Experts (MoE) architecture and supports a context length of up to 128k tokens. GLM-4.5 delivers significantly...

specs & pricing
Type
text
Provider
z-ai
Model ID
z-ai/glm-4.5
Capabilities
tools, reasoning
Context window
131K tokens
Self-hostable
Yes — runs on your own GPU
Input price
$0.66 / 1M tokens
Output price
$2.42 / 1M tokens

Cloud price is billed from prepaid credits when a request fails over to the cloud. Open-weight models run free on GPUs you own — the gateway routes to your nodes first.

Z.ai: GLM 4.5 — pricing, context & specs | Wide Area Intelligence