GPT-4o Realtime (Dec '24) vs Grok 4.20 0309 (Reasoning)

OpenAI vs xAI — side-by-side benchmark comparison

	GPT-4o Realtime (Dec '24)	Grok 4.20 0309 (Reasoning)
Intelligence Index	—	48.5
Coding Index	—	42.2
Math Index	—	—
Output speed (tok/s)	0.0	217.8
Blended price ($/1M)	$0.00	$3.00
Time to first token (s)	0.00s	13.18s
aime	—	—
aime 25	—	—
artificial analysis coding index	—	42.20
artificial analysis intelligence index	—	48.50
artificial analysis math index	—	—
gpqa	—	88.5%
hle	—	30.0%
ifbench	—	82.9%
lcr	—	59.0%
livecodebench	—	—
math 500	—	—
mmlu pro	—	—
scicode	—	44.7%
tau2	—	96.5%
terminalbench hard	—	40.9%

Benchmark data from Artificial Analysis.