← All comparisons

Gemini 3.1 Flash-Lite vs Hermes 4 - Llama-3.1 70B (Non-reasoning)

Google vs Nous Research — side-by-side benchmark comparison

Gemini 3.1 Flash-LiteHermes 4 - Llama-3.1 70B (Non-reasoning)
Intelligence Index33.512.6
Coding Index30.19.2
Math Index11.3
Output speed (tok/s)304.994.3
Blended price ($/1M)$0.56$0.20
Time to first token (s)5.00s0.61s
aime
aime 2511.3%
artificial analysis coding index30.109.20
artificial analysis intelligence index33.5012.60
artificial analysis math index11.30
gpqa82.2%49.1%
hle16.2%3.6%
ifbench77.2%29.0%
lcr65.3%2.0%
livecodebench26.9%
math 500
mmlu pro66.4%
scicode41.9%27.7%
tau231.3%21.6%
terminalbench hard24.2%0.0%

Benchmark data from Artificial Analysis.