o4-mini (high) vs Qwen3 0.6B (Reasoning)

OpenAI vs Alibaba — side-by-side benchmark comparison

	o4-mini (high)	Qwen3 0.6B (Reasoning)
Intelligence Index	33.1	6.5
Coding Index	25.6	0.9
Math Index	90.7	18.0
Output speed (tok/s)	160.5	222.0
Blended price ($/1M)	$1.93	$0.40
Time to first token (s)	23.07s	1.07s
aime	94.0%	10.0%
aime 25	90.7%	18.0%
artificial analysis coding index	25.60	90.0%
artificial analysis intelligence index	33.10	6.50
artificial analysis math index	90.70	18.00
gpqa	78.4%	23.9%
hle	17.5%	5.7%
ifbench	68.7%	23.3%
lcr	55.0%	0.0%
livecodebench	85.9%	12.1%
math 500	98.9%	75.0%
mmlu pro	83.2%	34.7%
scicode	46.5%	2.8%
tau2	55.6%	21.1%
terminalbench hard	15.2%	0.0%

Benchmark data from Artificial Analysis.