← All comparisons

Qwen3.5 0.8B (Reasoning) vs Grok 4.20 0309 v2 (Non-reasoning)

Alibaba vs xAI — side-by-side benchmark comparison

Qwen3.5 0.8B (Reasoning)Grok 4.20 0309 v2 (Non-reasoning)
Intelligence Index10.529.0
Coding Index0.022.0
Math Index
Output speed (tok/s)0.0175.2
Blended price ($/1M)$0.02$3.00
Time to first token (s)0.00s0.47s
aime
aime 25
artificial analysis coding index0.0%22.00
artificial analysis intelligence index10.5029.00
artificial analysis math index
gpqa11.1%77.6%
hle1.2%24.2%
ifbench21.5%49.3%
lcr5.3%17.3%
livecodebench
math 500
mmlu pro
scicode0.0%32.8%
tau247.7%59.9%
terminalbench hard0.0%16.7%

Benchmark data from Artificial Analysis.