← All comparisons

Hermes 4 - Llama-3.1 70B (Reasoning) vs DeepSeek R1 Distill Llama 70B

Nous Research vs DeepSeek — side-by-side benchmark comparison

Hermes 4 - Llama-3.1 70B (Reasoning)DeepSeek R1 Distill Llama 70B
Intelligence Index16.016.0
Coding Index14.411.4
Math Index68.753.7
Output speed (tok/s)92.846.8
Blended price ($/1M)$0.20$0.79
Time to first token (s)0.64s0.33s
aime67.0%
aime 2568.7%53.7%
artificial analysis coding index14.4011.40
artificial analysis intelligence index16.0016.00
artificial analysis math index68.7053.70
gpqa69.9%40.2%
hle7.9%6.1%
ifbench31.3%27.6%
lcr6.7%11.0%
livecodebench65.3%26.6%
math 50093.5%
mmlu pro81.1%79.5%
scicode34.1%31.3%
tau222.5%21.9%
terminalbench hard4.5%1.5%

Benchmark data from Artificial Analysis.