← All comparisons

Qwen3.5 9B (Reasoning) vs Hermes 3 - Llama-3.1 70B

Alibaba vs Nous Research — side-by-side benchmark comparison

Qwen3.5 9B (Reasoning)Hermes 3 - Llama-3.1 70B
Intelligence Index32.410.6
Coding Index25.3
Math Index
Output speed (tok/s)69.433.2
Blended price ($/1M)$0.11$0.30
Time to first token (s)1.37s0.38s
aime2.3%
aime 25
artificial analysis coding index25.30
artificial analysis intelligence index32.4010.60
artificial analysis math index
gpqa80.6%40.1%
hle13.3%4.1%
ifbench66.7%
lcr59.0%
livecodebench18.8%
math 50053.8%
mmlu pro57.1%
scicode27.5%23.1%
tau286.8%
terminalbench hard24.2%

Benchmark data from Artificial Analysis.