← All comparisons

GPT-4o (March 2025, chatgpt-4o-latest) vs Hermes 3 - Llama-3.1 70B

OpenAI vs Nous Research — side-by-side benchmark comparison

GPT-4o (March 2025, chatgpt-4o-latest)Hermes 3 - Llama-3.1 70B
Intelligence Index18.610.6
Coding Index
Math Index25.7
Output speed (tok/s)0.033.2
Blended price ($/1M)$0.00$0.30
Time to first token (s)0.00s0.38s
aime32.7%2.3%
aime 2525.7%
artificial analysis coding index
artificial analysis intelligence index18.6010.60
artificial analysis math index25.70
gpqa65.5%40.1%
hle5.0%4.1%
ifbench
lcr
livecodebench42.5%18.8%
math 50089.3%53.8%
mmlu pro80.3%57.1%
scicode36.6%23.1%
tau2
terminalbench hard

Benchmark data from Artificial Analysis.