mmnga-o/nvidia-nemotron-nano-9b-v2-japanese-gguf Q3_K_L GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.
Model Intelligence Sheet
mmnga-o/nvidia-nemotron-nano-9b-v2-japanese-gguf overview
nvidiaさんが公開しているNVIDIA-Nemotron-Nano-9B-v2-Japaneseのggufフォーマット変換版です。 imatrixのデータはTFMC/imatrix-dataset-for-japanese-llmを使用して作成しました。
Downloads
2,423
Likes
50
Pipeline
text-generation
Library
transformers
Visibility
Public
Access
Open
Repository Files & Downloads
13 files detected
Direct downloads for all repository files
| File | Type | Quantization | Size | Link |
|---|---|---|---|---|
| NVIDIA-Nemotron-Nano-9B-v2-Japanese-IQ3_M.gguf | GGUF | IQ3_M | 4.85 GB | Download |
| NVIDIA-Nemotron-Nano-9B-v2-Japanese-IQ4_NL.gguf | GGUF | IQ4_NL | 4.94 GB | Download |
| NVIDIA-Nemotron-Nano-9B-v2-Japanese-IQ4_XS.gguf | GGUF | IQ4_XS | 4.91 GB | Download |
| NVIDIA-Nemotron-Nano-9B-v2-Japanese-Q3_K_L.gguf | GGUF | Q3_K_L | 5.11 GB | Download |
| NVIDIA-Nemotron-Nano-9B-v2-Japanese-Q3_K_M.gguf | GGUF | Q3_K_M | 5.01 GB | Download |
| NVIDIA-Nemotron-Nano-9B-v2-Japanese-Q4_0.gguf | GGUF | — | 4.94 GB | Download |
| NVIDIA-Nemotron-Nano-9B-v2-Japanese-Q4_K_M.gguf | GGUF | Q4_K_M | 6.08 GB | Download |
| NVIDIA-Nemotron-Nano-9B-v2-Japanese-Q4_K_S.gguf | GGUF | Q4_K_S | 5.79 GB | Download |
| NVIDIA-Nemotron-Nano-9B-v2-Japanese-Q5_0.gguf | GGUF | — | 5.91 GB | Download |
| NVIDIA-Nemotron-Nano-9B-v2-Japanese-Q5_K_M.gguf | GGUF | Q5_K_M | 6.58 GB | Download |
| NVIDIA-Nemotron-Nano-9B-v2-Japanese-Q5_K_S.gguf | GGUF | Q5_K_S | 6.32 GB | Download |
| NVIDIA-Nemotron-Nano-9B-v2-Japanese-Q6_K.gguf | GGUF | Q6_K | 8.51 GB | Download |
| NVIDIA-Nemotron-Nano-9B-v2-Japanese-Q8_0.gguf | GGUF | — | 8.81 GB | Download |
Model Details Live
Metadata Inspector
Normalized metadata (stored in metadata_json)
{
"metadata": {},
"card_data": {
"library_name": "transformers",
"license": "other",
"license_name": "nvidia-open-model-license",
"license_link": "https://www.nvidia.com/en-us/agreements/enterprise-software/nvidia-open-model-license/",
"pipeline_tag": "text-generation",
"language": [
"en",
"ja"
],
"tags": [
"nvidia"
],
"base_model": [
"nvidia/NVIDIA-Nemotron-Nano-9B-v2-Japanese"
],
"datasets": [
"TFMC/imatrix-dataset-for-japanese-llm"
],
"track_downloads": true,
"frontmatter": {
"library_name": "transformers",
"license": "other",
"license_name": "nvidia-open-model-license",
"license_link": ">-",
"pipeline_tag": "text-generation",
"language": [
"en",
"ja"
],
"tags": [
"nvidia"
],
"base_model": [
"nvidia/NVIDIA-Nemotron-Nano-9B-v2-Japanese"
],
"datasets": [
"TFMC/imatrix-dataset-for-japanese-llm"
],
"track_downloads": "true"
},
"hero_image_url": "",
"summary": "nvidiaさんが公開しているNVIDIA-Nemotron-Nano-9B-v2-Japaneseのggufフォーマット変換版です。 imatrixのデータはTFMC/imatrix-dataset-for-japanese-llmを使用して作成しました。",
"quick_links": [],
"benchmark_table_html": "",
"readme_markdown": "---\nlibrary_name: transformers\nlicense: other\nlicense_name: nvidia-open-model-license\nlicense_link: >-\n https://www.nvidia.com/en-us/agreements/enterprise-software/nvidia-open-model-license/\npipeline_tag: text-generation\nlanguage:\n - en\n - ja\ntags:\n- nvidia\nbase_model:\n- nvidia/NVIDIA-Nemotron-Nano-9B-v2-Japanese\ndatasets:\n- TFMC/imatrix-dataset-for-japanese-llm\ntrack_downloads: true\n---\n\n# NVIDIA-Nemotron-Nano-9B-v2-Japanese-gguf\nnvidiaさんが公開している[NVIDIA-Nemotron-Nano-9B-v2-Japanese](https://huggingface.co/nvidia/NVIDIA-Nemotron-Nano-9B-v2-Japanese)のggufフォーマット変換版です。 \n\nimatrixのデータは[TFMC/imatrix-dataset-for-japanese-llm](https://huggingface.co/datasets/TFMC/imatrix-dataset-for-japanese-llm)を使用して作成しました。 \n \n## Usage\n\n```\ngit clone https://github.com/ggml-org/llama.cpp.git\ncd llama.cpp\ncmake -B build -DGGML_CUDA=ON\ncmake --build build --config Release\nbuild/bin/llama-cli -m 'NVIDIA-Nemotron-Nano-9B-v2-Japanese-gguf' -n 128 -c 128 -p 'あなたはプロの料理人です。レシピを教えて'\n```\n",
"related_quantizations": []
},
"tags": [
"transformers",
"gguf",
"nvidia",
"text-generation",
"en",
"ja",
"dataset:TFMC/imatrix-dataset-for-japanese-llm",
"base_model:nvidia/NVIDIA-Nemotron-Nano-9B-v2-Japanese",
"base_model:quantized:nvidia/NVIDIA-Nemotron-Nano-9B-v2-Japanese",
"license:other",
"endpoints_compatible",
"region:us",
"imatrix",
"conversational"
],
"likes": 50,
"downloads": 2423,
"gated": false,
"private": false,
"last_modified": "2026-02-18T05:31:45.000Z",
"created_at": "2026-02-18T05:08:14.000Z",
"pipeline_tag": "text-generation",
"library_name": "transformers"
}
Source payload excerpt (from Hugging Face API)
{
"_id": "6995493efc58fb46e5563584",
"id": "mmnga-o/NVIDIA-Nemotron-Nano-9B-v2-Japanese-gguf",
"modelId": "mmnga-o/NVIDIA-Nemotron-Nano-9B-v2-Japanese-gguf",
"sha": "309f87c41751301d2dd20c4e24894ea7e09e732d",
"createdAt": "2026-02-18T05:08:14.000Z",
"lastModified": "2026-02-18T05:31:45.000Z",
"author": "mmnga-o",
"downloads": 2423,
"likes": 50,
"gated": false,
"private": false,
"pipeline_tag": "text-generation",
"library_name": "transformers",
"siblings_count": 15
}