← All comparisons

Claude 3.5 Sonnet (June '24) vs Grok 4.20 0309 v2 (Non-reasoning)

Anthropic vs xAI — side-by-side benchmark comparison

Claude 3.5 Sonnet (June '24)Grok 4.20 0309 v2 (Non-reasoning)
Intelligence Index14.229.0
Coding Index26.022.0
Math Index
Output speed (tok/s)0.0175.2
Blended price ($/1M)$6.56$3.00
Time to first token (s)0.00s0.47s
aime9.7%
aime 25
artificial analysis coding index26.0022.00
artificial analysis intelligence index14.2029.00
artificial analysis math index
gpqa56.0%77.6%
hle3.7%24.2%
ifbench49.3%
lcr17.3%
livecodebench
math 50069.5%
mmlu pro75.1%
scicode31.6%32.8%
tau259.9%
terminalbench hard16.7%

Benchmark data from Artificial Analysis.