← model directory
z-ai
Z.ai: GLM 4.6
z-ai/glm-4.6
↓ runs free on your own hardwareCompared with GLM-4.5, this generation brings several key improvements: Longer context window: The context window has been expanded from 128K to 200K tokens, enabling the model to handle more complex...
specs & pricing
- Type
- text
- Provider
- z-ai
- Model ID
- z-ai/glm-4.6
- Capabilities
- tools, reasoning
- Context window
- 203K tokens
- Self-hostable
- Yes — runs on your own GPU
- Input price
- $0.47 / 1M tokens
- Output price
- $1.91 / 1M tokens
Cloud price is billed from prepaid credits when a request fails over to the cloud. Open-weight models run free on GPUs you own — the gateway routes to your nodes first.