← All comparisons

Llama 3.1 Nemotron Ultra 253B v1 (Reasoning) vs Qwen3 VL 32B (Reasoning)

NVIDIA vs Alibaba — side-by-side benchmark comparison

Llama 3.1 Nemotron Ultra 253B v1 (Reasoning)Qwen3 VL 32B (Reasoning)
Intelligence Index15.024.7
Coding Index13.114.5
Math Index63.784.7
Output speed (tok/s)52.396.3
Blended price ($/1M)$0.90$2.63
Time to first token (s)0.76s1.12s
aime74.7%
aime 2563.7%84.7%
artificial analysis coding index13.1014.50
artificial analysis intelligence index15.0024.70
artificial analysis math index63.7084.70
gpqa72.8%73.3%
hle8.1%9.6%
ifbench38.2%59.4%
lcr7.3%55.3%
livecodebench64.1%73.8%
math 50095.2%
mmlu pro82.5%81.8%
scicode34.7%28.5%
tau211.4%45.6%
terminalbench hard2.3%7.6%

Benchmark data from Artificial Analysis.