Claude 4.1 Opus (Non-reasoning) vs DeepSeek R1 Distill Qwen 14B

Anthropic vs DeepSeek — side-by-side benchmark comparison

	Claude 4.1 Opus (Non-reasoning)	DeepSeek R1 Distill Qwen 14B
Intelligence Index	36.0	15.8
Coding Index	—	—
Math Index	—	55.7
Output speed (tok/s)	44.7	0.0
Blended price ($/1M)	$32.81	$0.00
Time to first token (s)	1.63s	0.00s
aime	—	66.7%
aime 25	—	55.7%
artificial analysis coding index	—	—
artificial analysis intelligence index	36.00	15.80
artificial analysis math index	—	55.70
gpqa	—	48.4%
hle	—	4.4%
ifbench	—	22.1%
lcr	—	7.0%
livecodebench	—	37.6%
math 500	—	94.9%
mmlu pro	—	74.0%
scicode	—	23.9%
tau2	—	—
terminalbench hard	—	—

Benchmark data from Artificial Analysis.