← All comparisons

Qwen3.5 2B (Non-reasoning) vs GPT-4o (ChatGPT)

Alibaba vs OpenAI — side-by-side benchmark comparison

Qwen3.5 2B (Non-reasoning)GPT-4o (ChatGPT)
Intelligence Index14.714.1
Coding Index4.9
Math Index
Output speed (tok/s)272.00.0
Blended price ($/1M)$0.04$0.00
Time to first token (s)0.27s0.00s
aime10.3%
aime 25
artificial analysis coding index4.90
artificial analysis intelligence index14.7014.10
artificial analysis math index
gpqa43.8%51.1%
hle4.9%3.7%
ifbench29.1%
lcr13.7%53.0%
livecodebench
math 50079.7%
mmlu pro77.3%
scicode7.2%33.4%
tau281.6%
terminalbench hard3.8%

Benchmark data from Artificial Analysis.