← All comparisons

Hermes 4 - Llama-3.1 405B (Reasoning) vs GLM-4.6V (Non-reasoning)

Nous Research vs Z AI — side-by-side benchmark comparison

Hermes 4 - Llama-3.1 405B (Reasoning)GLM-4.6V (Non-reasoning)
Intelligence Index18.617.1
Coding Index16.011.1
Math Index69.726.3
Output speed (tok/s)38.638.5
Blended price ($/1M)$1.50$0.45
Time to first token (s)0.79s1.39s
aime
aime 2569.7%26.3%
artificial analysis coding index16.0011.10
artificial analysis intelligence index18.6017.10
artificial analysis math index69.7026.30
gpqa72.7%56.6%
hle10.3%3.7%
ifbench32.7%27.9%
lcr20.7%12.3%
livecodebench68.6%41.1%
math 500
mmlu pro82.9%75.2%
scicode25.2%27.2%
tau222.2%30.7%
terminalbench hard11.4%3.0%

Benchmark data from Artificial Analysis.