← All comparisons

Hermes 3 - Llama-3.1 70B vs Qwen3.5 27B (Non-reasoning)

Nous Research vs Alibaba — side-by-side benchmark comparison

Hermes 3 - Llama-3.1 70BQwen3.5 27B (Non-reasoning)
Intelligence Index10.637.2
Coding Index33.4
Math Index
Output speed (tok/s)33.295.3
Blended price ($/1M)$0.30$0.88
Time to first token (s)0.38s1.40s
aime2.3%
aime 25
artificial analysis coding index33.40
artificial analysis intelligence index10.6037.20
artificial analysis math index
gpqa40.1%84.2%
hle4.1%13.2%
ifbench46.9%
lcr55.7%
livecodebench18.8%
math 50053.8%
mmlu pro57.1%
scicode23.1%36.7%
tau287.1%
terminalbench hard31.8%

Benchmark data from Artificial Analysis.