← model directory
z-ai
Z.ai: GLM 4.5
z-ai/glm-4.5
↓ runs free on your own hardwareGLM-4.5 is our latest flagship foundation model, purpose-built for agent-based applications. It leverages a Mixture-of-Experts (MoE) architecture and supports a context length of up to 128k tokens. GLM-4.5 delivers significantly...
specs & pricing
- Type
- text
- Provider
- z-ai
- Model ID
- z-ai/glm-4.5
- Capabilities
- tools, reasoning
- Context window
- 131K tokens
- Self-hostable
- Yes — runs on your own GPU
- Input price
- $0.66 / 1M tokens
- Output price
- $2.42 / 1M tokens
Cloud price is billed from prepaid credits when a request fails over to the cloud. Open-weight models run free on GPUs you own — the gateway routes to your nodes first.