← All comparisons

Grok 4.20 0309 (Non-reasoning) vs Qwen3 VL 8B Instruct

xAI vs Alibaba — side-by-side benchmark comparison

Grok 4.20 0309 (Non-reasoning)Qwen3 VL 8B Instruct
Intelligence Index29.714.3
Coding Index25.47.3
Math Index27.3
Output speed (tok/s)202.6143.8
Blended price ($/1M)$3.00$0.31
Time to first token (s)0.50s0.93s
aime
aime 2527.3%
artificial analysis coding index25.407.30
artificial analysis intelligence index29.7014.30
artificial analysis math index27.30
gpqa78.5%42.7%
hle22.5%2.9%
ifbench47.8%32.3%
lcr18.0%15.3%
livecodebench33.2%
math 500
mmlu pro68.6%
scicode32.2%17.4%
tau269.6%29.2%
terminalbench hard22.0%2.3%

Benchmark data from Artificial Analysis.