← All comparisons

DeepHermes 3 - Mistral 24B Preview (Non-reasoning) vs Claude 3.7 Sonnet (Non-reasoning)

Nous Research vs Anthropic — side-by-side benchmark comparison

DeepHermes 3 - Mistral 24B Preview (Non-reasoning)Claude 3.7 Sonnet (Non-reasoning)
Intelligence Index10.930.8
Coding Index26.7
Math Index21.0
Output speed (tok/s)0.00.0
Blended price ($/1M)$0.00$6.56
Time to first token (s)0.00s0.00s
aime4.7%22.3%
aime 2521.0%
artificial analysis coding index26.70
artificial analysis intelligence index10.9030.80
artificial analysis math index21.00
gpqa38.2%65.6%
hle3.9%4.8%
ifbench44.0%
lcr48.3%
livecodebench19.5%39.4%
math 50059.5%85.0%
mmlu pro58.0%80.3%
scicode22.8%37.6%
tau250.0%
terminalbench hard21.2%

Benchmark data from Artificial Analysis.