← model directory

cloudflare

deepseek-ai/deepseek-r1-distill-qwen-32b

cloudflare/deepseek-ai/deepseek-r1-distill-qwen-32b

↓ runs free on your own hardware

DeepSeek-R1-Distill-Qwen-32B is a model distilled from DeepSeek-R1 based on Qwen2.5. It outperforms OpenAI-o1-mini across various benchmarks, achieving new state-of-the-art results for dense models.

specs & pricing
Type
text
Provider
cloudflare
Model ID
cloudflare/deepseek-ai/deepseek-r1-distill-qwen-32b
Context window
80K tokens
Self-hostable
Yes — runs on your own GPU
Input price
$0.55 / 1M tokens
Output price
$5.37 / 1M tokens

Cloud price is billed from prepaid credits when a request fails over to the cloud. Open-weight models run free on GPUs you own — the gateway routes to your nodes first.