← All comparisons

Hermes 4 - Llama-3.1 405B (Non-reasoning) vs QwQ 32B

Nous Research vs Alibaba — side-by-side benchmark comparison

Hermes 4 - Llama-3.1 405B (Non-reasoning)QwQ 32B
Intelligence Index17.619.7
Coding Index18.1
Math Index15.329.0
Output speed (tok/s)40.831.3
Blended price ($/1M)$1.50$0.74
Time to first token (s)0.73s0.45s
aime78.0%
aime 2515.3%29.0%
artificial analysis coding index18.10
artificial analysis intelligence index17.6019.70
artificial analysis math index15.3029.00
gpqa53.6%59.3%
hle4.2%8.2%
ifbench34.8%38.8%
lcr20.0%25.0%
livecodebench54.6%63.1%
math 50095.7%
mmlu pro72.9%76.4%
scicode34.6%35.8%
tau226.6%
terminalbench hard9.8%

Benchmark data from Artificial Analysis.