GraySoft
Projects Models About FAQ Contact Download guIDE →
Model Intelligence Sheet

noctrex/nemotron-3-nano-30b-a3b-mxfp4_moe-gguf overview

This is a MXFP4_MOE imatrix quantization of the model NVIDIA-Nemotron-3-Nano-30B-A3B, based on the imatrix from unsloth. Get the latest llama.cpp in order to run it. Also see the instructions here: Unsloth NVIDIA Nemotron 3 Nano - How To Run Guide

gguftext-generationbase_model:nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16base_model:quantized:nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16endpoints_compatibleregion:usimatrixconversational
noctrex/nemotron-3-nano-30b-a3b-mxfp4_moe-gguf visual
Downloads
389
Likes
17
Pipeline
text-generation
Library
Visibility
Public
Access
Open

Repository Files & Downloads

1 files detected
Direct downloads for all repository files
FileTypeQuantizationSizeLink
NVIDIA-Nemotron-3-Nano-30B-A3B-MXFP4_MOE.gguf GGUF 16.75 GB Download

Model Details Live

Model Slug
noctrex/nemotron-3-nano-30b-a3b-mxfp4_moe-gguf
Author
noctrex
Pipeline Task
text-generation
Library
Created
2025-12-15
Last Modified
2025-12-21
Gated
No
Private
No
HF SHA
08bd4a31c26f7aa1165946e1cfc5f3e659d59819
License
Unknown
Language
Unknown
Base Model
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16

Metadata Inspector

Normalized metadata (stored in metadata_json)
{
  "metadata": {},
  "card_data": {
    "pipeline_tag": "text-generation",
    "base_model": [
      "nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16"
    ],
    "frontmatter": {
      "pipeline_tag": "text-generation",
      "base_model": [
        "nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16"
      ]
    },
    "hero_image_url": "",
    "summary": "This is a MXFP4_MOE imatrix quantization of the model NVIDIA-Nemotron-3-Nano-30B-A3B, based on the imatrix from unsloth. Get the latest llama.cpp in order to run it. Also see the instructions here: Unsloth NVIDIA Nemotron 3 Nano - How To Run Guide",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\npipeline_tag: text-generation\nbase_model:\n- nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16\n---\nThis is a MXFP4_MOE imatrix quantization of the model [NVIDIA-Nemotron-3-Nano-30B-A3B](https://huggingface.co/nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16), based on the imatrix from unsloth.\n\nGet the latest [llama.cpp](https://github.com/ggml-org/llama.cpp/releases) in order to run it.\n\nAlso see the instructions here: [Unsloth NVIDIA Nemotron 3 Nano - How To Run Guide](https://docs.unsloth.ai/models/nemotron-3)\n\n",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "text-generation",
    "base_model:nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16",
    "base_model:quantized:nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16",
    "endpoints_compatible",
    "region:us",
    "imatrix",
    "conversational"
  ],
  "likes": 17,
  "downloads": 389,
  "gated": false,
  "private": false,
  "last_modified": "2025-12-21T17:28:45.000Z",
  "created_at": "2025-12-15T16:52:00.000Z",
  "pipeline_tag": "text-generation",
  "library_name": ""
}
Source payload excerpt (from Hugging Face API)
{
  "_id": "69403cb030570c31b8c6989d",
  "id": "noctrex/Nemotron-3-Nano-30B-A3B-MXFP4_MOE-GGUF",
  "modelId": "noctrex/Nemotron-3-Nano-30B-A3B-MXFP4_MOE-GGUF",
  "sha": "08bd4a31c26f7aa1165946e1cfc5f3e659d59819",
  "createdAt": "2025-12-15T16:52:00.000Z",
  "lastModified": "2025-12-21T17:28:45.000Z",
  "author": "noctrex",
  "downloads": 389,
  "likes": 17,
  "gated": false,
  "private": false,
  "pipeline_tag": "text-generation",
  "library_name": "",
  "siblings_count": 3
}