Model Intelligence Sheet

noctrex/nemotron-3-nano-30b-a3b-mxfp4_moe-gguf overview

This is a MXFP4_MOE imatrix quantization of the model NVIDIA-Nemotron-3-Nano-30B-A3B, based on the imatrix from unsloth. Get the latest llama.cpp in order to run it. Also see the instructions here: Unsloth NVIDIA Nemotron 3 Nano - How To Run Guide

gguftext-generationbase_model:nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16base_model:quantized:nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16endpoints_compatibleregion:usimatrixconversational

noctrex/nemotron-3-nano-30b-a3b-mxfp4_moe-gguf visual

Downloads

389

Likes

Pipeline

text-generation

Library

—

Visibility

Public

Access

Open

Repository Files & Downloads

1 files detected

Direct downloads for all repository files

File	Type	Quantization	Size	Link
NVIDIA-Nemotron-3-Nano-30B-A3B-MXFP4_MOE.gguf	GGUF	—	16.75 GB	Download

Model Details Live

Model Slug

noctrex/nemotron-3-nano-30b-a3b-mxfp4_moe-gguf

Author

noctrex

Pipeline Task

text-generation

Library

—

Created

2025-12-15

Last Modified

2025-12-21

Gated

Private

HF SHA

08bd4a31c26f7aa1165946e1cfc5f3e659d59819

License

Unknown

Language

Unknown

Base Model

nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16

Metadata Inspector

Normalized metadata (stored in metadata_json)

{
  "metadata": {},
  "card_data": {
    "pipeline_tag": "text-generation",
    "base_model": [
      "nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16"
    ],
    "frontmatter": {
      "pipeline_tag": "text-generation",
      "base_model": [
        "nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16"
      ]
    },
    "hero_image_url": "",
    "summary": "This is a MXFP4_MOE imatrix quantization of the model NVIDIA-Nemotron-3-Nano-30B-A3B, based on the imatrix from unsloth. Get the latest llama.cpp in order to run it. Also see the instructions here: Unsloth NVIDIA Nemotron 3 Nano - How To Run Guide",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\npipeline_tag: text-generation\nbase_model:\n- nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16\n---\nThis is a MXFP4_MOE imatrix quantization of the model [NVIDIA-Nemotron-3-Nano-30B-A3B](https://huggingface.co/nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16), based on the imatrix from unsloth.\n\nGet the latest [llama.cpp](https://github.com/ggml-org/llama.cpp/releases) in order to run it.\n\nAlso see the instructions here: [Unsloth NVIDIA Nemotron 3 Nano - How To Run Guide](https://docs.unsloth.ai/models/nemotron-3)\n\n",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "text-generation",
    "base_model:nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16",
    "base_model:quantized:nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16",
    "endpoints_compatible",
    "region:us",
    "imatrix",
    "conversational"
  ],
  "likes": 17,
  "downloads": 389,
  "gated": false,
  "private": false,
  "last_modified": "2025-12-21T17:28:45.000Z",
  "created_at": "2025-12-15T16:52:00.000Z",
  "pipeline_tag": "text-generation",
  "library_name": ""
}

Source payload excerpt (from Hugging Face API)

{
  "_id": "69403cb030570c31b8c6989d",
  "id": "noctrex/Nemotron-3-Nano-30B-A3B-MXFP4_MOE-GGUF",
  "modelId": "noctrex/Nemotron-3-Nano-30B-A3B-MXFP4_MOE-GGUF",
  "sha": "08bd4a31c26f7aa1165946e1cfc5f3e659d59819",
  "createdAt": "2025-12-15T16:52:00.000Z",
  "lastModified": "2025-12-21T17:28:45.000Z",
  "author": "noctrex",
  "downloads": 389,
  "likes": 17,
  "gated": false,
  "private": false,
  "pipeline_tag": "text-generation",
  "library_name": "",
  "siblings_count": 3
}