← All comparisons

Hermes 4 - Llama-3.1 70B (Non-reasoning) vs Qwen3.5 122B A10B (Non-reasoning)

Nous Research vs Alibaba — side-by-side benchmark comparison

Hermes 4 - Llama-3.1 70B (Non-reasoning)Qwen3.5 122B A10B (Non-reasoning)
Intelligence Index12.635.9
Coding Index9.231.6
Math Index11.3
Output speed (tok/s)94.3165.9
Blended price ($/1M)$0.20$1.10
Time to first token (s)0.61s1.06s
aime
aime 2511.3%
artificial analysis coding index9.2031.60
artificial analysis intelligence index12.6035.90
artificial analysis math index11.30
gpqa49.1%82.7%
hle3.6%14.8%
ifbench29.0%50.8%
lcr2.0%56.0%
livecodebench26.9%
math 500
mmlu pro66.4%
scicode27.7%35.6%
tau221.6%84.5%
terminalbench hard0.0%29.5%

Benchmark data from Artificial Analysis.