← model directorygoogle
Google: Gemma 3 4B
google/gemma-3-4b-it
↓ runs free on your own hardwareGemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...
specs & pricing
- Type
- text
- Provider
- Model ID
- google/gemma-3-4b-it
- Capabilities
- vision
- Context window
- 131K tokens
- Self-hostable
- Yes — runs on your own GPU
- Input price
- $0.044 / 1M tokens
- Output price
- $0.088 / 1M tokens
Cloud price is billed from prepaid credits when a request fails over to the cloud. Open-weight models run free on GPUs you own — the gateway routes to your nodes first.