← All comparisons

Olmo 3.1 32B Instruct vs Qwen3 VL 32B (Reasoning)

Allen Institute for AI vs Alibaba — side-by-side benchmark comparison

Olmo 3.1 32B InstructQwen3 VL 32B (Reasoning)
Intelligence Index12.224.7
Coding Index5.614.5
Math Index84.7
Output speed (tok/s)0.096.3
Blended price ($/1M)$0.00$2.63
Time to first token (s)0.00s1.12s
aime
aime 2584.7%
artificial analysis coding index5.6014.50
artificial analysis intelligence index12.2024.70
artificial analysis math index84.70
gpqa53.9%73.3%
hle4.9%9.6%
ifbench39.2%59.4%
lcr0.0%55.3%
livecodebench73.8%
math 500
mmlu pro81.8%
scicode16.7%28.5%
tau221.3%45.6%
terminalbench hard0.0%7.6%

Benchmark data from Artificial Analysis.