GPT-5.4 nano (Non-Reasoning) vs Claude 3.5 Sonnet (June '24)

OpenAI vs Anthropic — side-by-side benchmark comparison

	GPT-5.4 nano (Non-Reasoning)	Claude 3.5 Sonnet (June '24)
Intelligence Index	24.4	14.2
Coding Index	27.9	26.0
Math Index	—	—
Output speed (tok/s)	157.4	0.0
Blended price ($/1M)	$0.46	$6.56
Time to first token (s)	0.54s	0.00s
aime	—	9.7%
aime 25	—	—
artificial analysis coding index	27.90	26.00
artificial analysis intelligence index	24.40	14.20
artificial analysis math index	—	—
gpqa	55.8%	56.0%
hle	4.2%	3.7%
ifbench	32.7%	—
lcr	24.7%	—
livecodebench	—	—
math 500	—	69.5%
mmlu pro	—	75.1%
scicode	35.2%	31.6%
tau2	34.8%	—
terminalbench hard	24.2%	—

Benchmark data from Artificial Analysis.