← model directorygoogle
Google: Gemma 4 31B
google/gemma-4-31b-it
↓ runs free on your own hardwareGemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output. Features a 256K token context window, configurable thinking/reasoning mode, native function...
specs & pricing
- Type
- text
- Provider
- Model ID
- google/gemma-4-31b-it
- Capabilities
- vision, tools, reasoning
- Context window
- 262K tokens
- Self-hostable
- Yes — runs on your own GPU
- Input price
- $0.13 / 1M tokens
- Output price
- $0.41 / 1M tokens
Cloud price is billed from prepaid credits when a request fails over to the cloud. Open-weight models run free on GPUs you own — the gateway routes to your nodes first.