← All comparisons

Llama 3.1 Nemotron Ultra 253B v1 (Reasoning) vs Qwen3 VL 30B A3B Instruct

NVIDIA vs Alibaba — side-by-side benchmark comparison

Llama 3.1 Nemotron Ultra 253B v1 (Reasoning)Qwen3 VL 30B A3B Instruct
Intelligence Index15.016.0
Coding Index13.114.3
Math Index63.772.3
Output speed (tok/s)52.3123.5
Blended price ($/1M)$0.90$0.30
Time to first token (s)0.76s1.07s
aime74.7%
aime 2563.7%72.3%
artificial analysis coding index13.1014.30
artificial analysis intelligence index15.0016.00
artificial analysis math index63.7072.30
gpqa72.8%69.5%
hle8.1%6.4%
ifbench38.2%33.1%
lcr7.3%23.7%
livecodebench64.1%47.6%
math 50095.2%
mmlu pro82.5%76.4%
scicode34.7%30.8%
tau211.4%19.0%
terminalbench hard2.3%6.1%

Benchmark data from Artificial Analysis.