← All comparisons

Qwen3 235B A22B 2507 Instruct vs Qwen3 4B 2507 (Reasoning)

Alibaba vs Alibaba — side-by-side benchmark comparison

Qwen3 235B A22B 2507 InstructQwen3 4B 2507 (Reasoning)
Intelligence Index25.018.2
Coding Index22.19.5
Math Index71.782.7
Output speed (tok/s)57.00.0
Blended price ($/1M)$0.36$0.00
Time to first token (s)1.34s0.00s
aime71.7%
aime 2571.7%82.7%
artificial analysis coding index22.109.50
artificial analysis intelligence index25.0018.20
artificial analysis math index71.7082.70
gpqa75.3%66.7%
hle10.6%5.9%
ifbench46.1%49.8%
lcr31.2%37.7%
livecodebench52.4%64.1%
math 50098.0%
mmlu pro82.8%74.3%
scicode36.0%25.6%
tau233.3%25.4%
terminalbench hard15.2%1.5%

Benchmark data from Artificial Analysis.