Question 1

What is FreedomAISVR/Nemotron-3-30B-Nano-Omni-NVFP4-GGUF?

Accepted Answer

--- license: other language: - en library_name: gguf tags: - gguf - nemotron - nemotron-3 - nvidia - nvfp4 - mamba2 - hybrid - moe base_model: nvidia/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-BF16 pipeline_tag: text-generation inference: false quantized_by: FreedomAISVR --- # Nemotron-3-30B-Nano-Omni NVFP4 GGUF NVFP4 (E4M3) quantization of [nvidia/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-BF16](https://huggingface.co/nvidia/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-BF16), NVIDIA's 30B MoE (~3B active) reasoning model with Mamba2-SSM hybrid architecture. ### Why Is This File Bigger Than Expected? This model uses a **Mamba2-Transformer hybrid MoE architecture** — 23 out of 52 layers are Mamba2 state-space model (SSM) layers. The CUDA kernels that run Mamba2 SSM operations (`SSM_SCAN`, `SSM_CONV`) **require F32 inputs** and will reject quantized weight tensors entirely. When SSM weight tensors are …

Question 2

What license applies to FreedomAISVR/Nemotron-3-30B-Nano-Omni-NVFP4-GGUF?

Accepted Answer

License: other. Verify terms on Hugging Face before commercial use.

Question 3

How do I run FreedomAISVR/Nemotron-3-30B-Nano-Omni-NVFP4-GGUF locally?

Accepted Answer

Download a GGUF file from this page and load it in guIDE or llama.cpp. Pipeline task: text-generation.

Question 4

How much VRAM or disk space does FreedomAISVR/Nemotron-3-30B-Nano-Omni-NVFP4-GGUF need?

Accepted Answer

Runs locally from ~1.48 GB disk (4 GB VRAM class GPUs with llama.cpp / guIDE).

FreedomAISVR/Nemotron-3-30B-Nano-Omni-NVFP4-GGUF overview

Repository Files & Downloads

Model Details

Model README

Nemotron-3-30B-Nano-Omni NVFP4 GGUF

Why Is This File Bigger Than Expected?

The Workaround (Hybrid Quantization)

This Is a Bandaid, Not a Fix

Files

Architecture

Usage

Credits

Run FreedomAISVR/Nemotron-3-30B-Nano-Omni-NVFP4-GGUF with guIDE

File	Type	Quantization	Size	Link
mmproj-nemotron-3-30b-f16.gguf	GGUF	F16	1.48 GB	Download
nemotron-3-30b-NVFP4.gguf	GGUF	GGUF	17.93 GB	Download

Model ID	FreedomAISVR/Nemotron-3-30B-Nano-Omni-NVFP4-GGUF
Author	FreedomAISVR
Pipeline	text-generation
License	other
Base model	nvidia/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-BF16
Last modified	2026-06-18T12:42:45.000Z