← All comparisons

GPT-5.4 nano (Non-Reasoning) vs Llama 3.1 Nemotron Ultra 253B v1 (Reasoning)

OpenAI vs NVIDIA — side-by-side benchmark comparison

GPT-5.4 nano (Non-Reasoning)Llama 3.1 Nemotron Ultra 253B v1 (Reasoning)
Intelligence Index24.415.0
Coding Index27.913.1
Math Index63.7
Output speed (tok/s)157.452.3
Blended price ($/1M)$0.46$0.90
Time to first token (s)0.54s0.76s
aime74.7%
aime 2563.7%
artificial analysis coding index27.9013.10
artificial analysis intelligence index24.4015.00
artificial analysis math index63.70
gpqa55.8%72.8%
hle4.2%8.1%
ifbench32.7%38.2%
lcr24.7%7.3%
livecodebench64.1%
math 50095.2%
mmlu pro82.5%
scicode35.2%34.7%
tau234.8%11.4%
terminalbench hard24.2%2.3%

Benchmark data from Artificial Analysis.