← model directory

google

Google: Gemma 3 4B

google/gemma-3-4b-it

↓ runs free on your own hardware

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...

specs & pricing
Type
text
Provider
google
Model ID
google/gemma-3-4b-it
Capabilities
vision
Context window
131K tokens
Self-hostable
Yes — runs on your own GPU
Input price
$0.044 / 1M tokens
Output price
$0.088 / 1M tokens

Cloud price is billed from prepaid credits when a request fails over to the cloud. Open-weight models run free on GPUs you own — the gateway routes to your nodes first.