GraySoft
Projects Models Compare Cloud benchmarks FAQ Download guIDE →

DeepSeek cloud models

31 models tracked via Artificial Analysis. Compare cloud performance, then find local GGUF versions in the GraySoft model catalog.

ModelIntelligenceSpeed (tok/s)
DeepSeek V4 Pro (Reasoning, Max Effort)51.576.576
DeepSeek V4 Pro (Reasoning, High Effort)49.874.214
DeepSeek V4 Flash (Reasoning, Max Effort)46.596.704
DeepSeek V4 Flash (Reasoning, High Effort)460
DeepSeek V3.2 (Reasoning)41.70
DeepSeek V4 Pro (Non-reasoning)39.368.599
DeepSeek V4 Flash (Non-reasoning)36.589.443
DeepSeek V3.1 Terminus (Reasoning)33.90
DeepSeek V3.2 Exp (Reasoning)32.90
DeepSeek V3.2 (Non-reasoning)32.10
DeepSeek V3.2 Speciale29.40
DeepSeek V3.1 Terminus (Non-reasoning)28.50
DeepSeek V3.2 Exp (Non-reasoning)28.40
DeepSeek V3.1 (Non-reasoning)28.10
DeepSeek V3.1 (Reasoning)27.70
DeepSeek R1 0528 (May '25)27.10
DeepSeek V3 032422.30
DeepSeek R1 (Jan '25)18.80
DeepSeek R1 Distill Qwen 32B17.20
DeepSeek V3 (Dec '24)16.50
DeepSeek R1 0528 Qwen3 8B16.40
DeepSeek R1 Distill Llama 70B1644.425
DeepSeek R1 Distill Qwen 14B15.80
DeepSeek-V2.5 (Dec '24)12.50
DeepSeek-V2.512.30
DeepSeek R1 Distill Llama 8B12.10
DeepSeek-Coder-V210.60
DeepSeek R1 Distill Qwen 1.5B9.10
DeepSeek-V2-Chat9.10
DeepSeek Coder V2 Lite Instruct8.50
DeepSeek LLM 67B Chat (V1)8.40

Run models locally with guIDE

Download guIDE — the AI-native code editor with local LLM inference and 69 built-in tools.

Download guIDE → · Browse 524k+ models · Compare models