← All comparisons

Llama 3.3 Instruct 70B vs Hermes 4 - Llama-3.1 70B (Reasoning)

Meta vs Nous Research — side-by-side benchmark comparison

Llama 3.3 Instruct 70BHermes 4 - Llama-3.1 70B (Reasoning)
Intelligence Index14.516.0
Coding Index10.714.4
Math Index7.768.7
Output speed (tok/s)88.192.8
Blended price ($/1M)$0.62$0.20
Time to first token (s)0.59s0.64s
aime30.0%
aime 257.7%68.7%
artificial analysis coding index10.7014.40
artificial analysis intelligence index14.5016.00
artificial analysis math index7.7068.70
gpqa49.8%69.9%
hle4.0%7.9%
ifbench47.1%31.3%
lcr15.0%6.7%
livecodebench28.8%65.3%
math 50077.3%
mmlu pro71.3%81.1%
scicode26.0%34.1%
tau226.6%22.5%
terminalbench hard3.0%4.5%

Benchmark data from Artificial Analysis.