← All comparisons

Hermes 4 - Llama-3.1 70B (Non-reasoning) vs Mistral Large (Feb '24)

Nous Research vs Mistral — side-by-side benchmark comparison

Hermes 4 - Llama-3.1 70B (Non-reasoning)Mistral Large (Feb '24)
Intelligence Index12.69.9
Coding Index9.2
Math Index11.3
Output speed (tok/s)94.30.0
Blended price ($/1M)$0.20$6.00
Time to first token (s)0.61s0.00s
aime0.0%
aime 2511.3%
artificial analysis coding index9.20
artificial analysis intelligence index12.609.90
artificial analysis math index11.30
gpqa49.1%35.1%
hle3.6%3.4%
ifbench29.0%
lcr2.0%
livecodebench26.9%17.8%
math 50052.7%
mmlu pro66.4%51.5%
scicode27.7%20.8%
tau221.6%
terminalbench hard0.0%

Benchmark data from Artificial Analysis.