← All comparisons

Claude Opus 4.7 (Non-reasoning, High Effort) vs Qwen3 4B 2507 Instruct

Anthropic vs Alibaba — side-by-side benchmark comparison

Claude Opus 4.7 (Non-reasoning, High Effort)Qwen3 4B 2507 Instruct
Intelligence Index51.812.9
Coding Index53.19.0
Math Index52.3
Output speed (tok/s)47.80.0
Blended price ($/1M)$10.94$0.00
Time to first token (s)1.04s0.00s
aime
aime 2552.3%
artificial analysis coding index53.109.00
artificial analysis intelligence index51.8012.90
artificial analysis math index52.30
gpqa88.5%51.7%
hle31.2%4.7%
ifbench43.6%33.5%
lcr67.0%7.3%
livecodebench37.7%
math 500
mmlu pro67.2%
scicode50.1%18.1%
tau274.0%26.6%
terminalbench hard54.5%4.5%

Benchmark data from Artificial Analysis.