Granite 4.1 30B vs Qwen3 235B A22B 2507 Instruct

IBM vs Alibaba — side-by-side benchmark comparison

	Granite 4.1 30B	Qwen3 235B A22B 2507 Instruct
Intelligence Index	14.7	25.0
Coding Index	10.1	22.1
Math Index	—	71.7
Output speed (tok/s)	0.0	57.0
Blended price ($/1M)	$0.00	$0.36
Time to first token (s)	0.00s	1.34s
aime	—	71.7%
aime 25	—	71.7%
artificial analysis coding index	10.10	22.10
artificial analysis intelligence index	14.70	25.00
artificial analysis math index	—	71.70
gpqa	48.1%	75.3%
hle	4.2%	10.6%
ifbench	44.4%	46.1%
lcr	18.7%	31.2%
livecodebench	—	52.4%
math 500	—	98.0%
mmlu pro	—	82.8%
scicode	25.8%	36.0%
tau2	42.1%	33.3%
terminalbench hard	2.3%	15.2%

Benchmark data from Artificial Analysis.