Gemma 4 26B A4B (Non-reasoning) vs Qwen3.5 397B A17B (Reasoning)

Google vs Alibaba — side-by-side benchmark comparison

	Gemma 4 26B A4B (Non-reasoning)	Qwen3.5 397B A17B (Reasoning)
Intelligence Index	27.1	45.0
Coding Index	29.1	41.3
Math Index	—	—
Output speed (tok/s)	71.1	52.1
Blended price ($/1M)	$0.20	$1.35
Time to first token (s)	0.80s	1.81s
aime	—	—
aime 25	—	—
artificial analysis coding index	29.10	41.30
artificial analysis intelligence index	27.10	45.00
artificial analysis math index	—	—
gpqa	71.4%	89.3%
hle	10.7%	27.3%
ifbench	45.4%	78.8%
lcr	39.7%	65.7%
livecodebench	—	—
math 500	—	—
mmlu pro	—	—
scicode	37.3%	42.0%
tau2	40.4%	95.6%
terminalbench hard	25.0%	40.9%

Benchmark data from Artificial Analysis.