← model directory

z-ai

Z.ai: GLM 4.5 Air

z-ai/glm-4.5-air

↓ runs free on your own hardware

GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-built for agent-centric applications. Like GLM-4.5, it adopts the Mixture-of-Experts (MoE) architecture but with a more compact parameter...

specs & pricing
Type
text
Provider
z-ai
Model ID
z-ai/glm-4.5-air
Capabilities
tools, reasoning
Context window
131K tokens
Self-hostable
Yes — runs on your own GPU
Input price
$0.14 / 1M tokens
Output price
$0.94 / 1M tokens

Cloud price is billed from prepaid credits when a request fails over to the cloud. Open-weight models run free on GPUs you own — the gateway routes to your nodes first.