← All comparisons

Qwen3.5 397B A17B (Non-reasoning) vs Claude Opus 4.6 (Adaptive Reasoning, Max Effort)

Alibaba vs Anthropic — side-by-side benchmark comparison

Qwen3.5 397B A17B (Non-reasoning)Claude Opus 4.6 (Adaptive Reasoning, Max Effort)
Intelligence Index40.152.9
Coding Index37.448.1
Math Index
Output speed (tok/s)53.554.8
Blended price ($/1M)$1.35$10.94
Time to first token (s)1.85s11.69s
aime
aime 25
artificial analysis coding index37.4048.10
artificial analysis intelligence index40.1052.90
artificial analysis math index
gpqa86.1%89.6%
hle18.8%36.7%
ifbench51.6%53.1%
lcr58.0%70.7%
livecodebench
math 500
mmlu pro
scicode41.1%51.9%
tau283.9%92.1%
terminalbench hard35.6%46.2%

Benchmark data from Artificial Analysis.