← All comparisons

Qwen3.5 2B (Non-reasoning) vs o4-mini (high)

Alibaba vs OpenAI — side-by-side benchmark comparison

Qwen3.5 2B (Non-reasoning)o4-mini (high)
Intelligence Index14.733.1
Coding Index4.925.6
Math Index90.7
Output speed (tok/s)272.0160.5
Blended price ($/1M)$0.04$1.93
Time to first token (s)0.27s23.07s
aime94.0%
aime 2590.7%
artificial analysis coding index4.9025.60
artificial analysis intelligence index14.7033.10
artificial analysis math index90.70
gpqa43.8%78.4%
hle4.9%17.5%
ifbench29.1%68.7%
lcr13.7%55.0%
livecodebench85.9%
math 50098.9%
mmlu pro83.2%
scicode7.2%46.5%
tau281.6%55.6%
terminalbench hard3.8%15.2%

Benchmark data from Artificial Analysis.