← All comparisons

Qwen3.5 397B A17B (Reasoning) vs Hermes 3 - Llama-3.1 70B

Alibaba vs Nous Research — side-by-side benchmark comparison

Qwen3.5 397B A17B (Reasoning)Hermes 3 - Llama-3.1 70B
Intelligence Index45.010.6
Coding Index41.3
Math Index
Output speed (tok/s)52.133.2
Blended price ($/1M)$1.35$0.30
Time to first token (s)1.81s0.38s
aime2.3%
aime 25
artificial analysis coding index41.30
artificial analysis intelligence index45.0010.60
artificial analysis math index
gpqa89.3%40.1%
hle27.3%4.1%
ifbench78.8%
lcr65.7%
livecodebench18.8%
math 50053.8%
mmlu pro57.1%
scicode42.0%23.1%
tau295.6%
terminalbench hard40.9%

Benchmark data from Artificial Analysis.