dahara1/gemma-2-2b-jpn-it-gguf-japanese-imatrix IQ3_XS GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.

Model Intelligence Sheet

dahara1/gemma-2-2b-jpn-it-gguf-japanese-imatrix overview

google/gemma-2-2b-jpn-itを日本語が多く含まれる重要度行列(iMatrix)を使って量子化したgguf版です。日本語対応能力が多めに保持されている事を期待しています。 gemma-2-9b-itの4bit量子化版で比較したところ、perplexityスコアがわずかに改善する事がわかっています。 This is a quantized gguf version of google/gemma-2-2b-jpn-it using an importance matrix (iMatrix) that contains many Japanese words. I hope it retains more Japanese support. When compared with the 4-bit quantized version of gemma-2-9b-it, we found that the perplexity score improved slightly. # 使い方 How to Use. ggufフォーマットに対応したツールは様々なものがあるのでお好きなツールをお使いください。例えば、llama.cppでの使い方は以下です There are many tools that support the gguf format, so please use the one you like. For example, the usage for llama.cpp is as follows. Windows11のターミナル(CMD, Power shell)では日本語が化けてしまうのでブラウザを使ってください Please use a browser as Japanese characters will be garbled in the Windows 11 terminal (CMD, Power shell). 公式マニュアルに従ってllama.cppをビルドします Build llama.cpp according to the official manual ダウンロードしたモデルを指定して下記コマンドを実行します Execute command. ブラウザでhttp://127.0.0.1:8080を開きます Open http://127.0.0.1:8080 in your browser !image/png # どのモデルを使うべきですか？ Which model should I use? 人によって意見が異なりますが、目安としては以下です Opinions vary from person to person, but here are some guidelines:

ggufgemma2jabase_model:google/gemma-2-2b-jpn-itbase_model:quantized:google/gemma-2-2b-jpn-itlicense:gemmaendpoints_compatibleregion:usconversational

dahara1/gemma-2-2b-jpn-it-gguf-japanese-imatrix visual

Downloads

470

Likes

Pipeline

—

Library

—

Visibility

Public

Access

Open

Repository Files & Downloads

22 files detected

Direct downloads for all repository files

File	Type	Quantization	Size	Link
gemma-2-2B-jpn-it-BF16.gguf	GGUF	BF16	4.88 GB	Download
gemma-2-2b-jpn-it-IQ3_M.gguf	GGUF	IQ3_M	1.30 GB	Download
gemma-2-2b-jpn-it-IQ3_XS.gguf	GGUF	IQ3_XS	1.22 GB	Download
gemma-2-2b-jpn-it-IQ3_XXS.gguf	GGUF	IQ3_XXS	1.10 GB	Download
gemma-2-2b-jpn-it-IQ4_XS.gguf	GGUF	IQ4_XS	1.46 GB	Download
gemma-2-2b-jpn-it-Q3_K-f16.gguf	GGUF	Q3_K	2.01 GB	Download
gemma-2-2b-jpn-it-Q3_K_L.gguf	GGUF	Q3_K_L	1.49 GB	Download
gemma-2-2b-jpn-it-Q3_K_M.gguf	GGUF	Q3_K_M	1.36 GB	Download
gemma-2-2b-jpn-it-Q3_K_S.gguf	GGUF	Q3_K_S	1.27 GB	Download
gemma-2-2b-jpn-it-Q4_K-f16.gguf	GGUF	Q4_K	2.24 GB	Download
gemma-2-2b-jpn-it-Q4_K_L.gguf	GGUF	Q4_K_L	1.72 GB	Download
gemma-2-2b-jpn-it-Q4_K_M.gguf	GGUF	Q4_K_M	1.59 GB	Download
gemma-2-2b-jpn-it-Q4_K_S.gguf	GGUF	Q4_K_S	1.53 GB	Download
gemma-2-2b-jpn-it-Q5_K-f16.gguf	GGUF	Q5_K	2.44 GB	Download
gemma-2-2b-jpn-it-Q5_K_L.gguf	GGUF	Q5_K_L	1.92 GB	Download
gemma-2-2b-jpn-it-Q5_K_M.gguf	GGUF	Q5_K_M	1.79 GB	Download
gemma-2-2b-jpn-it-Q5_K_S.gguf	GGUF	Q5_K_S	1.75 GB	Download
gemma-2-2b-jpn-it-Q6_K-f16.gguf	GGUF	Q6_K	2.65 GB	Download
gemma-2-2b-jpn-it-Q6_K.gguf	GGUF	Q6_K	2.00 GB	Download
gemma-2-2b-jpn-it-Q6_K_L.gguf	GGUF	Q6_K_L	2.14 GB	Download
gemma-2-2b-jpn-it-Q8_0-f16.gguf	GGUF	F16	3.11 GB	Download
gemma-2-2b-jpn-it-Q8_0_L.gguf	GGUF	—	2.59 GB	Download

Model Details Live

Model Slug

dahara1/gemma-2-2b-jpn-it-gguf-japanese-imatrix

Author

dahara1

Pipeline Task

—

Library

—

Created

2024-10-03

Last Modified

2024-10-03

Gated

Private

HF SHA

f2a964a09385fbe6b05573bcdcf5159f03bdbd44

License

gemma

Language

Base Model

google/gemma-2-2b-jpn-it

Metadata Inspector

Normalized metadata (stored in metadata_json)

{
  "metadata": {},
  "card_data": {
    "license": "gemma",
    "language": [
      "ja"
    ],
    "base_model": [
      "google/gemma-2-2b-jpn-it"
    ],
    "tags": [
      "gemma2"
    ],
    "frontmatter": {
      "license": "gemma",
      "language": [
        "ja"
      ],
      "base_model": [
        "google/gemma-2-2b-jpn-it"
      ],
      "tags": [
        "gemma2"
      ]
    },
    "hero_image_url": "https://cdn-uploads.huggingface.co/production/uploads/630469550907b9a115c91e62/PHli0VVox8bt6ziQoP02B.png",
    "summary": "google/gemma-2-2b-jpn-itを日本語が多く含まれる重要度行列(iMatrix)を使って量子化したgguf版です。 日本語対応能力が多めに保持されている事を期待しています。 gemma-2-9b-itの4bit量子化版で比較したところ、perplexityスコアがわずかに改善する事がわかっています。 This is a quantized gguf version of google/gemma-2-2b-jpn-it using an importance matrix (iMatrix) that contains many Japanese words. I hope it retains more Japanese support. When compared with the 4-bit quantized version of gemma-2-9b-it, we found that the perplexity score improved slightly. # 使い方 How to Use. ggufフォーマットに対応したツールは様々なものがあるのでお好きなツールをお使いください。例えば、llama.cppでの使い方は以下です There are many tools that support the gguf format, so please use the one you like. For example, the usage for llama.cpp is as follows. Windows11のターミナル(CMD, Power shell)では日本語が化けてしまうのでブラウザを使ってください Please use a browser as Japanese characters will be garbled in the Windows 11 terminal (CMD, Power shell). 公式マニュアルに従ってllama.cppをビルドします Build llama.cpp according to the official manual ダウンロードしたモデルを指定して下記コマンドを実行します Execute command. `` llama.cpp\\build\\bin\\Release\\llama-server -m .\\gemma-2-9b-it-Q4_K_M-fp16.gguf `` ブラウザでhttp://127.0.0.1:8080を開きます Open http://127.0.0.1:8080 in your browser !image/png # どのモデルを使うべきですか？ Which model should I use? 人によって意見が異なりますが、目安としては以下です Opinions vary from person to person, but here are some guidelines:",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nlicense: gemma\nlanguage:\n- ja\nbase_model:\n- google/gemma-2-2b-jpn-it\ntags:\n- gemma2\n---\n\n# 本モデルについて About this model.\n\n[google/gemma-2-2b-jpn-it](https://huggingface.co/google/gemma-2-2b-jpn-it)を日本語が多く含まれる重要度行列(iMatrix)を使って量子化したgguf版です。  \n日本語対応能力が多めに保持されている事を期待しています。  \n[gemma-2-9b-itの4bit量子化版で比較](https://huggingface.co/dahara1/imatrix-jpn-test)したところ、perplexityスコアがわずかに改善する事がわかっています。\n\nThis is a quantized gguf version of [google/gemma-2-2b-jpn-it](https://huggingface.co/google/gemma-2-2b-jpn-it) using an importance matrix (iMatrix) that contains many Japanese words.  \nI hope it retains more Japanese support.  \nWhen [compared with the 4-bit quantized version of gemma-2-9b-it](https://huggingface.co/google/gemma-2-2b-jpn-it), we found that the perplexity score improved slightly.  \n\n# 使い方 How to Use. \n\nggufフォーマットに対応したツールは様々なものがあるのでお好きなツールをお使いください。例えば、[llama.cpp](https://github.com/ggerganov/llama.cpp)での使い方は以下です  \nThere are many tools that support the gguf format, so please use the one you like. For example, the usage for [llama.cpp](https://github.com/ggerganov/llama.cpp) is as follows.  \n\nWindows11のターミナル(CMD, Power shell)では日本語が化けてしまうのでブラウザを使ってください  \nPlease use a browser as Japanese characters will be garbled in the Windows 11 terminal (CMD, Power shell).  \n\n公式マニュアルに従ってllama.cppをビルドします  \nBuild llama.cpp according to the official manual  \n\nダウンロードしたモデルを指定して下記コマンドを実行します  \nExecute command.  \n```\nllama.cpp\\build\\bin\\Release\\llama-server -m .\\gemma-2-9b-it-Q4_K_M-fp16.gguf\n```\nブラウザでhttp://127.0.0.1:8080を開きます  \nOpen http://127.0.0.1:8080 in your browser  \n\n\n![image/png](https://cdn-uploads.huggingface.co/production/uploads/630469550907b9a115c91e62/PHli0VVox8bt6ziQoP02B.png)\n\n\n# どのモデルを使うべきですか？ Which model should I use?\n\n人によって意見が異なりますが、目安としては以下です  \n- できればQ4以上\n- メモリが許す限り大きいモデル(例えば、利用可能なメモリの７割程度)\n\nOpinions vary from person to person, but here are some guidelines:  \n- Preferably Q4 or higher\n- As large a model as memory allows (for example, about 70% of available memory)\n\n\n\n\n\n",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "gemma2",
    "ja",
    "base_model:google/gemma-2-2b-jpn-it",
    "base_model:quantized:google/gemma-2-2b-jpn-it",
    "license:gemma",
    "endpoints_compatible",
    "region:us",
    "conversational"
  ],
  "likes": 1,
  "downloads": 470,
  "gated": false,
  "private": false,
  "last_modified": "2024-10-03T15:56:59.000Z",
  "created_at": "2024-10-03T14:27:12.000Z",
  "pipeline_tag": "",
  "library_name": ""
}

Source payload excerpt (from Hugging Face API)

{
  "_id": "66fea9c05a359c1af1db4ef8",
  "id": "dahara1/gemma-2-2b-jpn-it-gguf-japanese-imatrix",
  "modelId": "dahara1/gemma-2-2b-jpn-it-gguf-japanese-imatrix",
  "sha": "f2a964a09385fbe6b05573bcdcf5159f03bdbd44",
  "createdAt": "2024-10-03T14:27:12.000Z",
  "lastModified": "2024-10-03T15:56:59.000Z",
  "author": "dahara1",
  "downloads": 470,
  "likes": 1,
  "gated": false,
  "private": false,
  "pipeline_tag": "",
  "library_name": "",
  "siblings_count": 24
}

dahara1/gemma-2-2b-jpn-it-gguf-japanese-imatrix overview

Repository Files & Downloads

Model Details Live

Metadata Inspector

More models in this shard