← All comparisons

Command-R (Mar '24) vs Qwen3 4B 2507 (Reasoning)

Cohere vs Alibaba — side-by-side benchmark comparison

Command-R (Mar '24)Qwen3 4B 2507 (Reasoning)
Intelligence Index7.418.2
Coding Index9.5
Math Index82.7
Output speed (tok/s)0.00.0
Blended price ($/1M)$0.75$0.00
Time to first token (s)0.00s0.00s
aime0.7%
aime 2582.7%
artificial analysis coding index9.50
artificial analysis intelligence index7.4018.20
artificial analysis math index82.70
gpqa28.4%66.7%
hle4.8%5.9%
ifbench49.8%
lcr37.7%
livecodebench4.8%64.1%
math 50016.4%
mmlu pro33.8%74.3%
scicode6.3%25.6%
tau225.4%
terminalbench hard1.5%

Benchmark data from Artificial Analysis.