← model directory

cloudflare

ibm-granite/granite-4.0-h-micro

cloudflare/ibm-granite/granite-4.0-h-micro

↓ runs free on your own hardware

Granite 4.0 instruct models deliver strong performance across benchmarks, achieving industry-leading results in key agentic tasks like instruction following and function calling. These efficiencies make the models well-suited for a wide range of use cases like retrieval-augmented generation (RAG), multi-agent workflows, and edge deployments.

specs & pricing
Type
text
Provider
cloudflare
Model ID
cloudflare/ibm-granite/granite-4.0-h-micro
Context window
131K tokens
Self-hostable
Yes — runs on your own GPU
Input price
$0.019 / 1M tokens
Output price
$0.12 / 1M tokens

Cloud price is billed from prepaid credits when a request fails over to the cloud. Open-weight models run free on GPUs you own — the gateway routes to your nodes first.