← All comparisons

Hermes 4 - Llama-3.1 405B (Reasoning) vs DeepSeek R1 Distill Qwen 1.5B

Nous Research vs DeepSeek — side-by-side benchmark comparison

Hermes 4 - Llama-3.1 405B (Reasoning)DeepSeek R1 Distill Qwen 1.5B
Intelligence Index18.69.1
Coding Index16.0
Math Index69.722.0
Output speed (tok/s)38.60.0
Blended price ($/1M)$1.50$0.00
Time to first token (s)0.79s0.00s
aime17.7%
aime 2569.7%22.0%
artificial analysis coding index16.00
artificial analysis intelligence index18.609.10
artificial analysis math index69.7022.00
gpqa72.7%9.8%
hle10.3%3.3%
ifbench32.7%13.2%
lcr20.7%0.3%
livecodebench68.6%7.0%
math 50068.7%
mmlu pro82.9%26.9%
scicode25.2%6.6%
tau222.2%
terminalbench hard11.4%

Benchmark data from Artificial Analysis.