← All comparisons

DeepHermes 3 - Llama-3.1 8B Preview (Non-reasoning) vs GPT-4o mini

Nous Research vs OpenAI — side-by-side benchmark comparison

DeepHermes 3 - Llama-3.1 8B Preview (Non-reasoning)GPT-4o mini
Intelligence Index7.612.6
Coding Index
Math Index14.7
Output speed (tok/s)0.069.6
Blended price ($/1M)$0.00$0.26
Time to first token (s)0.00s0.62s
aime0.0%11.7%
aime 2514.7%
artificial analysis coding index
artificial analysis intelligence index7.6012.60
artificial analysis math index14.70
gpqa27.0%42.6%
hle4.3%4.0%
ifbench31.0%
lcr
livecodebench8.5%23.4%
math 50021.8%78.9%
mmlu pro36.5%64.8%
scicode9.1%22.9%
tau2
terminalbench hard

Benchmark data from Artificial Analysis.