Claude 4.5 Haiku (Reasoning) vs DeepSeek V3.1 (Reasoning)

Anthropic vs DeepSeek — side-by-side benchmark comparison

	Claude 4.5 Haiku (Reasoning)	DeepSeek V3.1 (Reasoning)
Intelligence Index	37.1	27.7
Coding Index	32.6	29.7
Math Index	83.7	89.7
Output speed (tok/s)	142.2	0.0
Blended price ($/1M)	$2.19	$0.86
Time to first token (s)	10.48s	0.00s
aime	—	—
aime 25	83.7%	89.7%
artificial analysis coding index	32.60	29.70
artificial analysis intelligence index	37.10	27.70
artificial analysis math index	83.70	89.70
gpqa	67.2%	77.9%
hle	9.7%	13.0%
ifbench	54.3%	41.5%
lcr	70.3%	53.3%
livecodebench	61.5%	78.4%
math 500	—	—
mmlu pro	76.0%	85.1%
scicode	43.3%	39.1%
tau2	54.7%	37.4%
terminalbench hard	27.3%	25.0%

Benchmark data from Artificial Analysis.