← All comparisons

Hermes 4 - Llama-3.1 70B (Reasoning) vs ERNIE 4.5 300B A47B

Nous Research vs Baidu — side-by-side benchmark comparison

Hermes 4 - Llama-3.1 70B (Reasoning)ERNIE 4.5 300B A47B
Intelligence Index16.015.0
Coding Index14.414.5
Math Index68.741.3
Output speed (tok/s)92.825.7
Blended price ($/1M)$0.20$0.48
Time to first token (s)0.64s1.68s
aime49.3%
aime 2568.7%41.3%
artificial analysis coding index14.4014.50
artificial analysis intelligence index16.0015.00
artificial analysis math index68.7041.30
gpqa69.9%81.1%
hle7.9%3.5%
ifbench31.3%39.1%
lcr6.7%2.3%
livecodebench65.3%46.7%
math 50093.1%
mmlu pro81.1%77.6%
scicode34.1%31.5%
tau222.5%0.0%
terminalbench hard4.5%6.1%

Benchmark data from Artificial Analysis.