← All comparisons

GPT-5.5 Instant (May 2026) vs Hermes 4 - Llama-3.1 405B (Reasoning)

OpenAI vs Nous Research — side-by-side benchmark comparison

GPT-5.5 Instant (May 2026)Hermes 4 - Llama-3.1 405B (Reasoning)
Intelligence Index41.818.6
Coding Index45.116.0
Math Index69.7
Output speed (tok/s)0.038.6
Blended price ($/1M)$11.25$1.50
Time to first token (s)0.00s0.79s
aime
aime 2569.7%
artificial analysis coding index45.1016.00
artificial analysis intelligence index41.8018.60
artificial analysis math index69.70
gpqa84.6%72.7%
hle20.3%10.3%
ifbench71.5%32.7%
lcr55.7%20.7%
livecodebench68.6%
math 500
mmlu pro82.9%
scicode50.3%25.2%
tau249.4%22.2%
terminalbench hard42.4%11.4%

Benchmark data from Artificial Analysis.