← All comparisons

Hermes 4 - Llama-3.1 70B (Reasoning) vs Qwen3.5 Omni Plus

Nous Research vs Alibaba — side-by-side benchmark comparison

Hermes 4 - Llama-3.1 70B (Reasoning)Qwen3.5 Omni Plus
Intelligence Index16.038.6
Coding Index14.427.6
Math Index68.7
Output speed (tok/s)92.854.9
Blended price ($/1M)$0.20$1.50
Time to first token (s)0.64s1.29s
aime
aime 2568.7%
artificial analysis coding index14.4027.60
artificial analysis intelligence index16.0038.60
artificial analysis math index68.70
gpqa69.9%82.6%
hle7.9%13.9%
ifbench31.3%51.2%
lcr6.7%52.7%
livecodebench65.3%
math 500
mmlu pro81.1%
scicode34.1%40.5%
tau222.5%88.3%
terminalbench hard4.5%21.2%

Benchmark data from Artificial Analysis.