← All comparisons

Hermes 4 - Llama-3.1 405B (Non-reasoning) vs Gemini 2.5 Pro Preview (Mar' 25)

Nous Research vs Google — side-by-side benchmark comparison

Hermes 4 - Llama-3.1 405B (Non-reasoning)Gemini 2.5 Pro Preview (Mar' 25)
Intelligence Index17.630.3
Coding Index18.146.7
Math Index15.3
Output speed (tok/s)40.80.0
Blended price ($/1M)$1.50$0.00
Time to first token (s)0.73s0.00s
aime87.0%
aime 2515.3%
artificial analysis coding index18.1046.70
artificial analysis intelligence index17.6030.30
artificial analysis math index15.30
gpqa53.6%83.6%
hle4.2%17.1%
ifbench34.8%
lcr20.0%
livecodebench54.6%77.8%
math 50098.0%
mmlu pro72.9%85.8%
scicode34.6%39.5%
tau226.6%
terminalbench hard9.8%

Benchmark data from Artificial Analysis.