← All comparisons

Qwen3.5 9B (Non-reasoning) vs Claude 4.1 Opus (Reasoning)

Alibaba vs Anthropic — side-by-side benchmark comparison

Qwen3.5 9B (Non-reasoning)Claude 4.1 Opus (Reasoning)
Intelligence Index27.342.0
Coding Index21.336.5
Math Index80.3
Output speed (tok/s)0.044.5
Blended price ($/1M)$0.00$32.81
Time to first token (s)0.00s8.55s
aime
aime 2580.3%
artificial analysis coding index21.3036.50
artificial analysis intelligence index27.3042.00
artificial analysis math index80.30
gpqa78.6%80.9%
hle8.6%11.9%
ifbench37.8%55.4%
lcr38.0%66.3%
livecodebench65.4%
math 500
mmlu pro88.0%
scicode27.7%40.9%
tau285.1%71.4%
terminalbench hard18.2%34.3%

Benchmark data from Artificial Analysis.