← All comparisons

Qwen3.5 0.8B (Non-reasoning) vs Grok 4.20 0309 (Reasoning)

Alibaba vs xAI — side-by-side benchmark comparison

Qwen3.5 0.8B (Non-reasoning)Grok 4.20 0309 (Reasoning)
Intelligence Index9.948.5
Coding Index1.042.2
Math Index
Output speed (tok/s)96.3217.8
Blended price ($/1M)$0.02$3.00
Time to first token (s)0.26s13.18s
aime
aime 25
artificial analysis coding index100.0%42.20
artificial analysis intelligence index9.9048.50
artificial analysis math index
gpqa23.6%88.5%
hle4.9%30.0%
ifbench21.6%82.9%
lcr6.7%59.0%
livecodebench
math 500
mmlu pro
scicode2.9%44.7%
tau265.2%96.5%
terminalbench hard0.0%40.9%

Benchmark data from Artificial Analysis.