← All comparisons

Hermes 4 - Llama-3.1 405B (Reasoning) vs Qwen3.5 Omni Plus

Nous Research vs Alibaba — side-by-side benchmark comparison

Hermes 4 - Llama-3.1 405B (Reasoning)Qwen3.5 Omni Plus
Intelligence Index18.638.6
Coding Index16.027.6
Math Index69.7
Output speed (tok/s)38.654.9
Blended price ($/1M)$1.50$1.50
Time to first token (s)0.79s1.29s
aime
aime 2569.7%
artificial analysis coding index16.0027.60
artificial analysis intelligence index18.6038.60
artificial analysis math index69.70
gpqa72.7%82.6%
hle10.3%13.9%
ifbench32.7%51.2%
lcr20.7%52.7%
livecodebench68.6%
math 500
mmlu pro82.9%
scicode25.2%40.5%
tau222.2%88.3%
terminalbench hard11.4%21.2%

Benchmark data from Artificial Analysis.