Mistral Large 3 vs Grok 4.20 0309 v2 (Non-reasoning)

Mistral vs xAI — side-by-side benchmark comparison

	Mistral Large 3	Grok 4.20 0309 v2 (Non-reasoning)
Intelligence Index	22.8	29.0
Coding Index	22.7	22.0
Math Index	38.0	—
Output speed (tok/s)	62.3	175.2
Blended price ($/1M)	$0.75	$3.00
Time to first token (s)	0.57s	0.47s
aime	—	—
aime 25	38.0%	—
artificial analysis coding index	22.70	22.00
artificial analysis intelligence index	22.80	29.00
artificial analysis math index	38.00	—
gpqa	68.0%	77.6%
hle	4.1%	24.2%
ifbench	36.2%	49.3%
lcr	34.7%	17.3%
livecodebench	46.5%	—
math 500	—	—
mmlu pro	80.7%	—
scicode	36.2%	32.8%
tau2	24.6%	59.9%
terminalbench hard	15.9%	16.7%

Benchmark data from Artificial Analysis.