← model directory
cloudflare
ibm-granite/granite-4.0-h-micro
cloudflare/ibm-granite/granite-4.0-h-micro
↓ runs free on your own hardwareGranite 4.0 instruct models deliver strong performance across benchmarks, achieving industry-leading results in key agentic tasks like instruction following and function calling. These efficiencies make the models well-suited for a wide range of use cases like retrieval-augmented generation (RAG), multi-agent workflows, and edge deployments.
specs & pricing
- Type
- text
- Provider
- cloudflare
- Model ID
- cloudflare/ibm-granite/granite-4.0-h-micro
- Context window
- 131K tokens
- Self-hostable
- Yes — runs on your own GPU
- Input price
- $0.019 / 1M tokens
- Output price
- $0.12 / 1M tokens
Cloud price is billed from prepaid credits when a request fails over to the cloud. Open-weight models run free on GPUs you own — the gateway routes to your nodes first.