Claude 3.5 Sonnet (June '24) vs DeepSeek R1 Distill Qwen 32B

Anthropic vs DeepSeek — side-by-side benchmark comparison

	Claude 3.5 Sonnet (June '24)	DeepSeek R1 Distill Qwen 32B
Intelligence Index	14.2	17.2
Coding Index	26.0	—
Math Index	—	63.0
Output speed (tok/s)	0.0	0.0
Blended price ($/1M)	$6.56	$0.00
Time to first token (s)	0.00s	0.00s
aime	9.7%	68.7%
aime 25	—	63.0%
artificial analysis coding index	26.00	—
artificial analysis intelligence index	14.20	17.20
artificial analysis math index	—	63.00
gpqa	56.0%	61.5%
hle	3.7%	5.5%
ifbench	—	22.9%
lcr	—	9.7%
livecodebench	—	27.0%
math 500	69.5%	94.1%
mmlu pro	75.1%	73.9%
scicode	31.6%	37.6%
tau2	—	—
terminalbench hard	—	—

Benchmark data from Artificial Analysis.