← All comparisons

Hermes 4 - Llama-3.1 70B (Reasoning) vs Mi:dm K 2.5 Pro

Nous Research vs Korea Telecom — side-by-side benchmark comparison

Hermes 4 - Llama-3.1 70B (Reasoning)Mi:dm K 2.5 Pro
Intelligence Index16.023.1
Coding Index14.412.6
Math Index68.776.7
Output speed (tok/s)92.80.0
Blended price ($/1M)$0.20$0.00
Time to first token (s)0.64s0.00s
aime
aime 2568.7%76.7%
artificial analysis coding index14.4012.60
artificial analysis intelligence index16.0023.10
artificial analysis math index68.7076.70
gpqa69.9%70.1%
hle7.9%7.7%
ifbench31.3%49.3%
lcr6.7%9.0%
livecodebench65.3%65.6%
math 500
mmlu pro81.1%80.9%
scicode34.1%33.2%
tau222.5%86.5%
terminalbench hard4.5%2.3%

Benchmark data from Artificial Analysis.