← All comparisons

Llama 3.1 Nemotron Ultra 253B v1 (Reasoning) vs DeepSeek R1 0528 (May '25)

NVIDIA vs DeepSeek — side-by-side benchmark comparison

Llama 3.1 Nemotron Ultra 253B v1 (Reasoning)DeepSeek R1 0528 (May '25)
Intelligence Index15.027.1
Coding Index13.124.0
Math Index63.776.0
Output speed (tok/s)52.30.0
Blended price ($/1M)$0.90$2.06
Time to first token (s)0.76s0.00s
aime74.7%89.3%
aime 2563.7%76.0%
artificial analysis coding index13.1024.00
artificial analysis intelligence index15.0027.10
artificial analysis math index63.7076.00
gpqa72.8%81.3%
hle8.1%14.9%
ifbench38.2%39.6%
lcr7.3%54.7%
livecodebench64.1%77.0%
math 50095.2%98.3%
mmlu pro82.5%84.9%
scicode34.7%40.3%
tau211.4%36.5%
terminalbench hard2.3%15.9%

Benchmark data from Artificial Analysis.