DeepSeek R1 Distill Llama 70B vs Grok 4.1 Fast (Reasoning)

DeepSeek vs xAI — side-by-side benchmark comparison

	DeepSeek R1 Distill Llama 70B	Grok 4.1 Fast (Reasoning)
Intelligence Index	16.0	38.6
Coding Index	11.4	30.9
Math Index	53.7	89.3
Output speed (tok/s)	46.8	0.0
Blended price ($/1M)	$0.79	$0.00
Time to first token (s)	0.33s	0.00s
aime	67.0%	—
aime 25	53.7%	89.3%
artificial analysis coding index	11.40	30.90
artificial analysis intelligence index	16.00	38.60
artificial analysis math index	53.70	89.30
gpqa	40.2%	85.3%
hle	6.1%	17.6%
ifbench	27.6%	52.7%
lcr	11.0%	68.0%
livecodebench	26.6%	82.2%
math 500	93.5%	—
mmlu pro	79.5%	85.4%
scicode	31.3%	44.2%
tau2	21.9%	93.3%
terminalbench hard	1.5%	24.2%

Benchmark data from Artificial Analysis.