← All comparisons

Llama 3.1 Nemotron Ultra 253B v1 (Reasoning) vs Qwen3.6 27B (Reasoning)

NVIDIA vs Alibaba — side-by-side benchmark comparison

Llama 3.1 Nemotron Ultra 253B v1 (Reasoning)Qwen3.6 27B (Reasoning)
Intelligence Index15.045.8
Coding Index13.136.5
Math Index63.7
Output speed (tok/s)52.357.4
Blended price ($/1M)$0.90$1.35
Time to first token (s)0.76s1.46s
aime74.7%
aime 2563.7%
artificial analysis coding index13.1036.50
artificial analysis intelligence index15.0045.80
artificial analysis math index63.70
gpqa72.8%84.2%
hle8.1%21.6%
ifbench38.2%67.6%
lcr7.3%68.7%
livecodebench64.1%
math 50095.2%
mmlu pro82.5%
scicode34.7%39.8%
tau211.4%94.2%
terminalbench hard2.3%34.8%

Benchmark data from Artificial Analysis.