← model directory

deepseek

DeepSeek: DeepSeek V4 Flash

deepseek/deepseek-v4-flash

↓ runs free on your own hardware

DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model from DeepSeek with 284B total parameters and 13B activated parameters, supporting a 1M-token context window. It is designed for fast inference and...

specs & pricing
Type
text
Provider
deepseek
Model ID
deepseek/deepseek-v4-flash
Capabilities
tools, reasoning
Context window
1.0M tokens
Self-hostable
Yes — runs on your own GPU
Input price
$0.11 / 1M tokens
Output price
$0.22 / 1M tokens

Cloud price is billed from prepaid credits when a request fails over to the cloud. Open-weight models run free on GPUs you own — the gateway routes to your nodes first.