← All comparisons

Hermes 4 - Llama-3.1 405B (Non-reasoning) vs Hermes 4 - Llama-3.1 70B (Non-reasoning)

Nous Research vs Nous Research — side-by-side benchmark comparison

Hermes 4 - Llama-3.1 405B (Non-reasoning)Hermes 4 - Llama-3.1 70B (Non-reasoning)
Intelligence Index17.612.6
Coding Index18.19.2
Math Index15.311.3
Output speed (tok/s)40.894.3
Blended price ($/1M)$1.50$0.20
Time to first token (s)0.73s0.61s
aime
aime 2515.3%11.3%
artificial analysis coding index18.109.20
artificial analysis intelligence index17.6012.60
artificial analysis math index15.3011.30
gpqa53.6%49.1%
hle4.2%3.6%
ifbench34.8%29.0%
lcr20.0%2.0%
livecodebench54.6%26.9%
math 500
mmlu pro72.9%66.4%
scicode34.6%27.7%
tau226.6%21.6%
terminalbench hard9.8%0.0%

Benchmark data from Artificial Analysis.