o1-mini vs Qwen3 VL 235B A22B (Reasoning)

OpenAI vs Alibaba — side-by-side benchmark comparison

	o1-mini	Qwen3 VL 235B A22B (Reasoning)
Intelligence Index	20.4	27.6
Coding Index	—	20.9
Math Index	—	88.3
Output speed (tok/s)	0.0	35.6
Blended price ($/1M)	$0.00	$2.17
Time to first token (s)	0.00s	5.14s
aime	60.3%	—
aime 25	—	88.3%
artificial analysis coding index	—	20.90
artificial analysis intelligence index	20.40	27.60
artificial analysis math index	—	88.30
gpqa	60.3%	77.2%
hle	4.9%	10.1%
ifbench	—	56.5%
lcr	—	58.7%
livecodebench	57.6%	64.6%
math 500	94.4%	—
mmlu pro	74.2%	83.6%
scicode	32.3%	39.9%
tau2	—	54.1%
terminalbench hard	—	11.4%

Benchmark data from Artificial Analysis.