Gemma 4 31B (Non-reasoning) vs Molmo2-8B

Google vs Allen Institute for AI — side-by-side benchmark comparison

	Gemma 4 31B (Non-reasoning)	Molmo2-8B
Intelligence Index	32.3	7.3
Coding Index	33.9	4.4
Math Index	—	—
Output speed (tok/s)	18.0	0.0
Blended price ($/1M)	$0.20	$0.00
Time to first token (s)	0.60s	0.00s
aime	—	—
aime 25	—	—
artificial analysis coding index	33.90	4.40
artificial analysis intelligence index	32.30	7.30
artificial analysis math index	—	—
gpqa	76.3%	42.5%
hle	11.5%	4.4%
ifbench	53.5%	26.9%
lcr	36.0%	0.0%
livecodebench	—	—
math 500	—	—
mmlu pro	—	—
scicode	41.1%	13.3%
tau2	65.5%	0.0%
terminalbench hard	30.3%	0.0%

Benchmark data from Artificial Analysis.