← model directory

z-ai

Z.ai: GLM 4 32B

z-ai/glm-4-32b

↓ runs free on your own hardware

GLM 4 32B is a cost-effective foundation language model. It can efficiently perform complex tasks and has significantly enhanced capabilities in tool use, online search, and code-related intelligent tasks. It...

specs & pricing
Type
text
Provider
z-ai
Model ID
z-ai/glm-4-32b
Capabilities
tools
Context window
128K tokens
Self-hostable
Yes — runs on your own GPU
Input price
$0.11 / 1M tokens
Output price
$0.11 / 1M tokens

Cloud price is billed from prepaid credits when a request fails over to the cloud. Open-weight models run free on GPUs you own — the gateway routes to your nodes first.

Z.ai: GLM 4 32B — pricing, context & specs | Wide Area Intelligence