← All comparisons

Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort) vs Qwen3 VL 30B A3B (Reasoning)

Anthropic vs Alibaba — side-by-side benchmark comparison

Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort)Qwen3 VL 30B A3B (Reasoning)
Intelligence Index51.719.7
Coding Index50.913.1
Math Index82.3
Output speed (tok/s)68.2123.6
Blended price ($/1M)$6.56$0.34
Time to first token (s)55.35s1.09s
aime
aime 2582.3%
artificial analysis coding index50.9013.10
artificial analysis intelligence index51.7019.70
artificial analysis math index82.30
gpqa87.5%72.0%
hle30.0%8.7%
ifbench56.6%45.1%
lcr70.7%40.7%
livecodebench69.7%
math 500
mmlu pro80.7%
scicode46.8%28.8%
tau275.7%19.9%
terminalbench hard53.0%5.3%

Benchmark data from Artificial Analysis.