← model directory

google

Google: Gemma 4 31B

google/gemma-4-31b-it

↓ runs free on your own hardware

Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output. Features a 256K token context window, configurable thinking/reasoning mode, native function...

specs & pricing
Type
text
Provider
google
Model ID
google/gemma-4-31b-it
Capabilities
vision, tools, reasoning
Context window
262K tokens
Self-hostable
Yes — runs on your own GPU
Input price
$0.13 / 1M tokens
Output price
$0.41 / 1M tokens

Cloud price is billed from prepaid credits when a request fails over to the cloud. Open-weight models run free on GPUs you own — the gateway routes to your nodes first.