← All comparisons

Grok 4.20 0309 (Non-reasoning) vs Qwen3 VL 4B Instruct

xAI vs Alibaba — side-by-side benchmark comparison

Grok 4.20 0309 (Non-reasoning)Qwen3 VL 4B Instruct
Intelligence Index29.79.6
Coding Index25.44.6
Math Index37.0
Output speed (tok/s)202.60.0
Blended price ($/1M)$3.00$0.00
Time to first token (s)0.50s0.00s
aime
aime 2537.0%
artificial analysis coding index25.404.60
artificial analysis intelligence index29.709.60
artificial analysis math index37.00
gpqa78.5%37.1%
hle22.5%3.7%
ifbench47.8%31.8%
lcr18.0%13.0%
livecodebench29.0%
math 500
mmlu pro63.4%
scicode32.2%13.7%
tau269.6%23.4%
terminalbench hard22.0%0.0%

Benchmark data from Artificial Analysis.