← All comparisons

Claude Sonnet 4.6 (Non-reasoning, Low Effort) vs Qwen3 VL 30B A3B (Reasoning)

Anthropic vs Alibaba — side-by-side benchmark comparison

Claude Sonnet 4.6 (Non-reasoning, Low Effort)Qwen3 VL 30B A3B (Reasoning)
Intelligence Index42.619.7
Coding Index43.013.1
Math Index82.3
Output speed (tok/s)54.9123.6
Blended price ($/1M)$6.56$0.34
Time to first token (s)1.13s1.09s
aime
aime 2582.3%
artificial analysis coding index43.0013.10
artificial analysis intelligence index42.6019.70
artificial analysis math index82.30
gpqa79.7%72.0%
hle10.8%8.7%
ifbench42.4%45.1%
lcr58.7%40.7%
livecodebench69.7%
math 500
mmlu pro80.7%
scicode44.1%28.8%
tau278.9%19.9%
terminalbench hard42.4%5.3%

Benchmark data from Artificial Analysis.