← All comparisons

Hermes 4 - Llama-3.1 405B (Reasoning) vs Qwen3.5 122B A10B (Reasoning)

Nous Research vs Alibaba — side-by-side benchmark comparison

Hermes 4 - Llama-3.1 405B (Reasoning)Qwen3.5 122B A10B (Reasoning)
Intelligence Index18.641.6
Coding Index16.034.7
Math Index69.7
Output speed (tok/s)38.6146.7
Blended price ($/1M)$1.50$1.10
Time to first token (s)0.79s1.05s
aime
aime 2569.7%
artificial analysis coding index16.0034.70
artificial analysis intelligence index18.6041.60
artificial analysis math index69.70
gpqa72.7%85.7%
hle10.3%23.4%
ifbench32.7%75.7%
lcr20.7%66.7%
livecodebench68.6%
math 500
mmlu pro82.9%
scicode25.2%42.0%
tau222.2%93.6%
terminalbench hard11.4%31.1%

Benchmark data from Artificial Analysis.