GPT-4o (May '24) vs Claude 3 Haiku

OpenAI vs Anthropic — side-by-side benchmark comparison

	GPT-4o (May '24)	Claude 3 Haiku
Intelligence Index	14.5	12.3
Coding Index	24.2	6.7
Math Index	—	—
Output speed (tok/s)	111.8	0.0
Blended price ($/1M)	$7.50	$0.50
Time to first token (s)	0.61s	0.00s
aime	11.0%	1.0%
aime 25	—	—
artificial analysis coding index	24.20	6.70
artificial analysis intelligence index	14.50	12.30
artificial analysis math index	—	—
gpqa	52.6%	37.4%
hle	2.8%	3.9%
ifbench	—	36.1%
lcr	—	21.0%
livecodebench	33.4%	15.4%
math 500	79.1%	39.4%
mmlu pro	74.0%	—
scicode	30.9%	18.6%
tau2	—	21.1%
terminalbench hard	—	0.8%

Benchmark data from Artificial Analysis.