← All comparisons

Grok 4.3 (medium) vs DeepHermes 3 - Llama-3.1 8B Preview (Non-reasoning)

xAI vs Nous Research — side-by-side benchmark comparison

Grok 4.3 (medium)DeepHermes 3 - Llama-3.1 8B Preview (Non-reasoning)
Intelligence Index48.87.6
Coding Index35.1
Math Index
Output speed (tok/s)112.50.0
Blended price ($/1M)$1.56$0.00
Time to first token (s)17.68s0.00s
aime0.0%
aime 25
artificial analysis coding index35.10
artificial analysis intelligence index48.807.60
artificial analysis math index
gpqa89.0%27.0%
hle28.1%4.3%
ifbench83.3%
lcr65.0%
livecodebench8.5%
math 50021.8%
mmlu pro36.5%
scicode44.6%9.1%
tau291.2%
terminalbench hard30.3%

Benchmark data from Artificial Analysis.