GPT-5.1 (Non-reasoning) vs Qwen3.5 27B (Non-reasoning)

OpenAI vs Alibaba — side-by-side benchmark comparison

	GPT-5.1 (Non-reasoning)	Qwen3.5 27B (Non-reasoning)
Intelligence Index	27.4	37.2
Coding Index	27.3	33.4
Math Index	38.0	—
Output speed (tok/s)	129.5	95.3
Blended price ($/1M)	$3.44	$0.88
Time to first token (s)	0.72s	1.40s
aime	—	—
aime 25	38.0%	—
artificial analysis coding index	27.30	33.40
artificial analysis intelligence index	27.40	37.20
artificial analysis math index	38.00	—
gpqa	64.3%	84.2%
hle	5.2%	13.2%
ifbench	43.2%	46.9%
lcr	44.0%	55.7%
livecodebench	49.4%	—
math 500	—	—
mmlu pro	80.1%	—
scicode	36.5%	36.7%
tau2	46.5%	87.1%
terminalbench hard	22.7%	31.8%

Benchmark data from Artificial Analysis.