← All comparisons

Hermes 4 - Llama-3.1 70B (Reasoning) vs Llama 3.3 Nemotron Super 49B v1 (Reasoning)

Nous Research vs NVIDIA — side-by-side benchmark comparison

Hermes 4 - Llama-3.1 70B (Reasoning)Llama 3.3 Nemotron Super 49B v1 (Reasoning)
Intelligence Index16.018.5
Coding Index14.49.4
Math Index68.754.7
Output speed (tok/s)92.80.0
Blended price ($/1M)$0.20$0.00
Time to first token (s)0.64s0.00s
aime58.3%
aime 2568.7%54.7%
artificial analysis coding index14.409.40
artificial analysis intelligence index16.0018.50
artificial analysis math index68.7054.70
gpqa69.9%64.3%
hle7.9%6.5%
ifbench31.3%38.1%
lcr6.7%17.0%
livecodebench65.3%27.7%
math 50095.9%
mmlu pro81.1%78.5%
scicode34.1%28.2%
tau222.5%26.9%
terminalbench hard4.5%0.0%

Benchmark data from Artificial Analysis.