← All comparisons

Claude 2.0 vs Qwen3 235B A22B (Reasoning)

Anthropic vs Alibaba — side-by-side benchmark comparison

Claude 2.0Qwen3 235B A22B (Reasoning)
Intelligence Index9.119.8
Coding Index12.917.4
Math Index82.0
Output speed (tok/s)0.058.3
Blended price ($/1M)$0.00$2.63
Time to first token (s)0.00s1.37s
aime0.0%84.0%
aime 2582.0%
artificial analysis coding index12.9017.40
artificial analysis intelligence index9.1019.80
artificial analysis math index82.00
gpqa34.4%70.0%
hle11.7%
ifbench38.7%
lcr0.0%
livecodebench17.1%62.2%
math 50093.0%
mmlu pro48.6%82.8%
scicode19.4%39.9%
tau224.0%
terminalbench hard6.1%

Benchmark data from Artificial Analysis.