GraySoft
Projects Models Compare Cloud benchmarks FAQ Download guIDE →
Model Intelligence Sheet

DevQuasar/nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16-GGUF overview

<img src="https://raw.githubusercontent.com/csabakecskemeti/devquasar/main/dq logo black transparent.png" width="200"/ https://devquasar.com 'Make knowledge fr…

gguftext-generationbase_model:nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16base_model:quantized:nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16endpoints_compatibleregion:usimatrixconversational

Runs locally from ~4.57 GB disk (8 GB VRAM class GPUs with llama.cpp / guIDE).

Downloads
176
Likes
0
Pipeline
text-generation
Author

Repository Files & Downloads

120 GGUF files detected
Direct downloads for local inference
FileTypeQuantizationSizeLink
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.IQ1_M-00001-of-00009.ggufGGUFBF1614.37 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.IQ1_M-00002-of-00009.ggufGGUFBF1614.01 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.IQ1_M-00003-of-00009.ggufGGUFBF1614.01 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.IQ1_M-00004-of-00009.ggufGGUFBF1614.05 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.IQ1_M-00005-of-00009.ggufGGUFBF1614.05 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.IQ1_M-00006-of-00009.ggufGGUFBF1614.01 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.IQ1_M-00007-of-00009.ggufGGUFBF1614.05 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.IQ1_M-00008-of-00009.ggufGGUFBF1614.01 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.IQ1_M-00009-of-00009.ggufGGUFBF164.57 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.IQ2_XXS-00001-of-00010.ggufGGUFBF1613.56 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.IQ2_XXS-00002-of-00010.ggufGGUFBF1614.82 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.IQ2_XXS-00003-of-00010.ggufGGUFBF1613.79 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.IQ2_XXS-00004-of-00010.ggufGGUFBF1613.75 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.IQ2_XXS-00005-of-00010.ggufGGUFBF1613.75 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.IQ2_XXS-00006-of-00010.ggufGGUFBF1613.79 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.IQ2_XXS-00007-of-00010.ggufGGUFBF1613.75 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.IQ2_XXS-00008-of-00010.ggufGGUFBF1613.79 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.IQ2_XXS-00009-of-00010.ggufGGUFBF1613.75 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.IQ2_XXS-00010-of-00010.ggufGGUFBF1610.87 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q2_K-00001-of-00014.ggufGGUFBF1613.34 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q2_K-00002-of-00014.ggufGGUFBF1614.27 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q2_K-00003-of-00014.ggufGGUFBF1613.85 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q2_K-00004-of-00014.ggufGGUFBF1614.22 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q2_K-00005-of-00014.ggufGGUFBF1613.90 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q2_K-00006-of-00014.ggufGGUFBF1614.22 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q2_K-00007-of-00014.ggufGGUFBF1613.85 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q2_K-00008-of-00014.ggufGGUFBF1614.27 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q2_K-00009-of-00014.ggufGGUFBF1613.85 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q2_K-00010-of-00014.ggufGGUFBF1614.27 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q2_K-00011-of-00014.ggufGGUFBF1613.85 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q2_K-00012-of-00014.ggufGGUFBF1614.27 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q2_K-00013-of-00014.ggufGGUFBF1613.85 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q2_K-00014-of-00014.ggufGGUFBF1611.88 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q3_K_M-00001-of-00020.ggufGGUFBF1613.17 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q3_K_M-00002-of-00020.ggufGGUFBF1614.06 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q3_K_M-00003-of-00020.ggufGGUFBF1612.89 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q3_K_M-00004-of-00020.ggufGGUFBF1613.36 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q3_K_M-00005-of-00020.ggufGGUFBF1612.89 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q3_K_M-00006-of-00020.ggufGGUFBF1613.42 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q3_K_M-00007-of-00020.ggufGGUFBF1612.89 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q3_K_M-00008-of-00020.ggufGGUFBF1613.36 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q3_K_M-00009-of-00020.ggufGGUFBF1612.89 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q3_K_M-00010-of-00020.ggufGGUFBF1613.36 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q3_K_M-00011-of-00020.ggufGGUFBF1612.89 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q3_K_M-00012-of-00020.ggufGGUFBF1613.42 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q3_K_M-00013-of-00020.ggufGGUFBF1612.89 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q3_K_M-00014-of-00020.ggufGGUFBF1613.36 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q3_K_M-00015-of-00020.ggufGGUFBF1612.89 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q3_K_M-00016-of-00020.ggufGGUFBF1613.42 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q3_K_M-00017-of-00020.ggufGGUFBF1612.82 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q3_K_M-00018-of-00020.ggufGGUFBF1613.42 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q3_K_M-00019-of-00020.ggufGGUFBF1612.82 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q3_K_M-00020-of-00020.ggufGGUFBF165.07 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q4_K_M-00001-of-00024.ggufGGUFBF1613.09 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q4_K_M-00002-of-00024.ggufGGUFBF1614.63 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q4_K_M-00003-of-00024.ggufGGUFBF1614.55 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q4_K_M-00004-of-00024.ggufGGUFBF1613.32 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q4_K_M-00005-of-00024.ggufGGUFBF1613.24 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q4_K_M-00006-of-00024.ggufGGUFBF1613.32 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q4_K_M-00007-of-00024.ggufGGUFBF1613.24 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q4_K_M-00008-of-00024.ggufGGUFBF1613.32 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q4_K_M-00009-of-00024.ggufGGUFBF1614.86 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q4_K_M-00010-of-00024.ggufGGUFBF1613.46 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q4_K_M-00011-of-00024.ggufGGUFBF1612.01 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q4_K_M-00012-of-00024.ggufGGUFBF1613.24 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q4_K_M-00013-of-00024.ggufGGUFBF1612.01 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q4_K_M-00014-of-00024.ggufGGUFBF1613.32 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q4_K_M-00015-of-00024.ggufGGUFBF1611.93 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q4_K_M-00016-of-00024.ggufGGUFBF1613.32 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q4_K_M-00017-of-00024.ggufGGUFBF1611.93 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q4_K_M-00018-of-00024.ggufGGUFBF1613.32 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q4_K_M-00019-of-00024.ggufGGUFBF1611.93 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q4_K_M-00020-of-00024.ggufGGUFBF1614.63 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q4_K_M-00021-of-00024.ggufGGUFBF1613.24 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q4_K_M-00022-of-00024.ggufGGUFBF1614.63 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q4_K_M-00023-of-00024.ggufGGUFBF1614.55 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q4_K_M-00024-of-00024.ggufGGUFBF1614.33 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q5_K_M-00001-of-00030.ggufGGUFBF1613.95 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q5_K_M-00002-of-00030.ggufGGUFBF1611.81 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q5_K_M-00003-of-00030.ggufGGUFBF1612.17 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q5_K_M-00004-of-00030.ggufGGUFBF1611.81 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q5_K_M-00005-of-00030.ggufGGUFBF1611.49 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q5_K_M-00006-of-00030.ggufGGUFBF1611.05 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q5_K_M-00007-of-00030.ggufGGUFBF1612.26 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q5_K_M-00008-of-00030.ggufGGUFBF1614.58 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q5_K_M-00009-of-00030.ggufGGUFBF1611.81 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q5_K_M-00010-of-00030.ggufGGUFBF1611.49 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q5_K_M-00011-of-00030.ggufGGUFBF1614.67 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q5_K_M-00012-of-00030.ggufGGUFBF1611.73 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q5_K_M-00013-of-00030.ggufGGUFBF1614.67 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q5_K_M-00014-of-00030.ggufGGUFBF1611.49 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q5_K_M-00015-of-00030.ggufGGUFBF1614.58 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q5_K_M-00016-of-00030.ggufGGUFBF1611.14 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q5_K_M-00017-of-00030.ggufGGUFBF1611.49 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q5_K_M-00018-of-00030.ggufGGUFBF1614.67 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q5_K_M-00019-of-00030.ggufGGUFBF1611.73 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q5_K_M-00020-of-00030.ggufGGUFBF1614.67 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q5_K_M-00021-of-00030.ggufGGUFBF1611.49 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q5_K_M-00022-of-00030.ggufGGUFBF1614.67 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q5_K_M-00023-of-00030.ggufGGUFBF1611.05 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q5_K_M-00024-of-00030.ggufGGUFBF1612.26 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q5_K_M-00025-of-00030.ggufGGUFBF1611.05 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q5_K_M-00026-of-00030.ggufGGUFBF1612.17 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q5_K_M-00027-of-00030.ggufGGUFBF1611.81 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q5_K_M-00028-of-00030.ggufGGUFBF1612.17 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q5_K_M-00029-of-00030.ggufGGUFBF1611.73 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q5_K_M-00030-of-00030.ggufGGUFBF167.69 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q6_K-00001-of-00032.ggufGGUFBF1614.87 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q6_K-00002-of-00032.ggufGGUFBF1613.29 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q6_K-00003-of-00032.ggufGGUFBF1612.90 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q6_K-00004-of-00032.ggufGGUFBF1613.29 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q6_K-00005-of-00032.ggufGGUFBF1612.90 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q6_K-00006-of-00032.ggufGGUFBF1613.18 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q6_K-00007-of-00032.ggufGGUFBF1613.01 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q6_K-00008-of-00032.ggufGGUFBF1613.18 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q6_K-00009-of-00032.ggufGGUFBF1612.90 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q6_K-00010-of-00032.ggufGGUFBF1613.29 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q6_K-00011-of-00032.ggufGGUFBF1612.90 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q6_K-00012-of-00032.ggufGGUFBF1613.29 GBDownload
nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16.Q6_K-00013-of-00032.ggufGGUFBF1612.90 GBDownload

Model Details

Model IDDevQuasar/nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16-GGUF
AuthorDevQuasar
Pipelinetext-generation
License
Base modelnvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16
Last modified2026-06-07T05:03:19.000Z

Model README

---

base_model:

  • nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16

pipeline_tag: text-generation

---

<img src="https://raw.githubusercontent.com/csabakecskemeti/devquasar/main/dq_logo_black-transparent.png" width="200"/>

'Make knowledge free for everyone'

Quantized version of: nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16

Zeroshot

!Screencast From 2026-06-04 16-27-27

<a href='https://ko-fi.com/L4L416YX7C' target='_blank'><img height='36' style='border:0px;height:36px;' src='https://storage.ko-fi.com/cdn/kofi6.png?v=6' border='0' alt='Buy Me a Coffee at ko-fi.com' /></a>

Run DevQuasar/nvidia.NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16-GGUF with guIDE

Download guIDE — the AI-native code editor with local LLM inference and 69 built-in tools.

Download guIDE → · Browse 524k+ models · Compare models

Source: Hugging Face · Compare models