← All comparisons

Hermes 4 - Llama-3.1 405B (Reasoning) vs Hermes 3 - Llama-3.1 70B

Nous Research vs Nous Research — side-by-side benchmark comparison

Hermes 4 - Llama-3.1 405B (Reasoning)Hermes 3 - Llama-3.1 70B
Intelligence Index18.610.6
Coding Index16.0
Math Index69.7
Output speed (tok/s)38.633.2
Blended price ($/1M)$1.50$0.30
Time to first token (s)0.79s0.38s
aime2.3%
aime 2569.7%
artificial analysis coding index16.00
artificial analysis intelligence index18.6010.60
artificial analysis math index69.70
gpqa72.7%40.1%
hle10.3%4.1%
ifbench32.7%
lcr20.7%
livecodebench68.6%18.8%
math 50053.8%
mmlu pro82.9%57.1%
scicode25.2%23.1%
tau222.2%
terminalbench hard11.4%

Benchmark data from Artificial Analysis.