← All comparisons

GPT-5.4 nano (Non-Reasoning) vs Hermes 3 - Llama-3.1 70B

OpenAI vs Nous Research — side-by-side benchmark comparison

GPT-5.4 nano (Non-Reasoning)Hermes 3 - Llama-3.1 70B
Intelligence Index24.410.6
Coding Index27.9
Math Index
Output speed (tok/s)157.433.2
Blended price ($/1M)$0.46$0.30
Time to first token (s)0.54s0.38s
aime2.3%
aime 25
artificial analysis coding index27.90
artificial analysis intelligence index24.4010.60
artificial analysis math index
gpqa55.8%40.1%
hle4.2%4.1%
ifbench32.7%
lcr24.7%
livecodebench18.8%
math 50053.8%
mmlu pro57.1%
scicode35.2%23.1%
tau234.8%
terminalbench hard24.2%

Benchmark data from Artificial Analysis.