Mistral Small 4 (Reasoning) vs Claude Opus 4.5 (Non-reasoning)

Mistral vs Anthropic — side-by-side benchmark comparison

	Mistral Small 4 (Reasoning)	Claude Opus 4.5 (Non-reasoning)
Intelligence Index	27.8	43.1
Coding Index	24.3	42.9
Math Index	—	62.7
Output speed (tok/s)	172.9	57.0
Blended price ($/1M)	$0.26	$10.94
Time to first token (s)	0.46s	0.93s
aime	—	—
aime 25	—	62.7%
artificial analysis coding index	24.30	42.90
artificial analysis intelligence index	27.80	43.10
artificial analysis math index	—	62.70
gpqa	76.9%	81.0%
hle	9.5%	12.9%
ifbench	48.2%	43.0%
lcr	44.7%	65.3%
livecodebench	—	73.8%
math 500	—	—
mmlu pro	—	88.9%
scicode	38.0%	47.0%
tau2	41.2%	86.3%
terminalbench hard	17.4%	40.9%

Benchmark data from Artificial Analysis.