← All comparisons

GPT-5.4 nano (medium) vs Hermes 4 - Llama-3.1 405B (Non-reasoning)

OpenAI vs Nous Research — side-by-side benchmark comparison

GPT-5.4 nano (medium)Hermes 4 - Llama-3.1 405B (Non-reasoning)
Intelligence Index38.117.6
Coding Index35.018.1
Math Index15.3
Output speed (tok/s)152.640.8
Blended price ($/1M)$0.46$1.50
Time to first token (s)2.09s0.73s
aime
aime 2515.3%
artificial analysis coding index35.0018.10
artificial analysis intelligence index38.1017.60
artificial analysis math index15.30
gpqa76.1%53.6%
hle14.7%4.2%
ifbench64.4%34.8%
lcr57.3%20.0%
livecodebench54.6%
math 500
mmlu pro72.9%
scicode38.4%34.6%
tau252.6%26.6%
terminalbench hard33.3%9.8%

Benchmark data from Artificial Analysis.