← All comparisons

Hermes 4 - Llama-3.1 405B (Reasoning) vs DeepSeek R1 Distill Llama 70B

Nous Research vs DeepSeek — side-by-side benchmark comparison

Hermes 4 - Llama-3.1 405B (Reasoning)DeepSeek R1 Distill Llama 70B
Intelligence Index18.616.0
Coding Index16.011.4
Math Index69.753.7
Output speed (tok/s)38.646.8
Blended price ($/1M)$1.50$0.79
Time to first token (s)0.79s0.33s
aime67.0%
aime 2569.7%53.7%
artificial analysis coding index16.0011.40
artificial analysis intelligence index18.6016.00
artificial analysis math index69.7053.70
gpqa72.7%40.2%
hle10.3%6.1%
ifbench32.7%27.6%
lcr20.7%11.0%
livecodebench68.6%26.6%
math 50093.5%
mmlu pro82.9%79.5%
scicode25.2%31.3%
tau222.2%21.9%
terminalbench hard11.4%1.5%

Benchmark data from Artificial Analysis.