Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort) vs Qwen Chat 72B

Anthropic vs Alibaba — side-by-side benchmark comparison

	Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort)	Qwen Chat 72B
Intelligence Index	51.7	8.8
Coding Index	50.9	—
Math Index	—	—
Output speed (tok/s)	68.2	0.0
Blended price ($/1M)	$6.56	$0.00
Time to first token (s)	55.35s	0.00s
aime	—	—
aime 25	—	—
artificial analysis coding index	50.90	—
artificial analysis intelligence index	51.70	8.80
artificial analysis math index	—	—
gpqa	87.5%	—
hle	30.0%	—
ifbench	56.6%	—
lcr	70.7%	—
livecodebench	—	—
math 500	—	—
mmlu pro	—	—
scicode	46.8%	—
tau2	75.7%	—
terminalbench hard	53.0%	—

Benchmark data from Artificial Analysis.