← All comparisons

Qwen3.5 4B (Non-reasoning) vs DeepSeek V3.1 Terminus (Reasoning)

Alibaba vs DeepSeek — side-by-side benchmark comparison

Qwen3.5 4B (Non-reasoning)DeepSeek V3.1 Terminus (Reasoning)
Intelligence Index22.633.9
Coding Index13.733.7
Math Index89.7
Output speed (tok/s)210.00.0
Blended price ($/1M)$0.06$1.91
Time to first token (s)0.23s0.00s
aime
aime 2589.7%
artificial analysis coding index13.7033.70
artificial analysis intelligence index22.6033.90
artificial analysis math index89.70
gpqa71.2%79.2%
hle7.5%15.2%
ifbench33.3%57.0%
lcr28.3%65.0%
livecodebench79.8%
math 500
mmlu pro85.1%
scicode18.3%40.6%
tau287.7%37.1%
terminalbench hard11.4%30.3%

Benchmark data from Artificial Analysis.