← All comparisons

Llama 3.1 Nemotron Ultra 253B v1 (Reasoning) vs Qwen3.5 Omni Flash

NVIDIA vs Alibaba — side-by-side benchmark comparison

Llama 3.1 Nemotron Ultra 253B v1 (Reasoning)Qwen3.5 Omni Flash
Intelligence Index15.025.9
Coding Index13.114.0
Math Index63.7
Output speed (tok/s)52.3260.3
Blended price ($/1M)$0.90$0.28
Time to first token (s)0.76s1.07s
aime74.7%
aime 2563.7%
artificial analysis coding index13.1014.00
artificial analysis intelligence index15.0025.90
artificial analysis math index63.70
gpqa72.8%74.2%
hle8.1%7.1%
ifbench38.2%38.0%
lcr7.3%44.0%
livecodebench64.1%
math 50095.2%
mmlu pro82.5%
scicode34.7%25.5%
tau211.4%84.5%
terminalbench hard2.3%8.3%

Benchmark data from Artificial Analysis.