← All comparisons

Hermes 4 - Llama-3.1 70B (Reasoning) vs K-EXAONE (Reasoning)

Nous Research vs LG AI Research — side-by-side benchmark comparison

Hermes 4 - Llama-3.1 70B (Reasoning)K-EXAONE (Reasoning)
Intelligence Index16.032.1
Coding Index14.427.0
Math Index68.790.3
Output speed (tok/s)92.80.0
Blended price ($/1M)$0.20$0.00
Time to first token (s)0.64s0.00s
aime
aime 2568.7%90.3%
artificial analysis coding index14.4027.00
artificial analysis intelligence index16.0032.10
artificial analysis math index68.7090.30
gpqa69.9%78.3%
hle7.9%13.1%
ifbench31.3%64.7%
lcr6.7%55.7%
livecodebench65.3%76.8%
math 500
mmlu pro81.1%83.8%
scicode34.1%35.6%
tau222.5%74.3%
terminalbench hard4.5%22.7%

Benchmark data from Artificial Analysis.