← All comparisons

Grok Beta vs Hermes 3 - Llama-3.1 70B

xAI vs Nous Research — side-by-side benchmark comparison

Grok BetaHermes 3 - Llama-3.1 70B
Intelligence Index13.310.6
Coding Index
Math Index
Output speed (tok/s)0.033.2
Blended price ($/1M)$0.00$0.30
Time to first token (s)0.00s0.38s
aime10.3%2.3%
aime 25
artificial analysis coding index
artificial analysis intelligence index13.3010.60
artificial analysis math index
gpqa47.1%40.1%
hle4.7%4.1%
ifbench
lcr
livecodebench24.1%18.8%
math 50073.7%53.8%
mmlu pro70.3%57.1%
scicode29.5%23.1%
tau2
terminalbench hard

Benchmark data from Artificial Analysis.