GraySoft
Projects Models About FAQ Contact Download guIDE →
Model Intelligence Sheet

nocturne23/gemma-3-12b-it-q4_k_m-gguf overview

This model was converted to GGUF format from google/gemma-3-12b-it using llama.cpp via the ggml.ai's GGUF-my-repo space. Refer to the original model card for more details on the model.

transformersggufllama-cppgguf-my-repoimage-text-to-textbase_model:google/gemma-3-12b-itbase_model:quantized:google/gemma-3-12b-itlicense:gemmaendpoints_compatibleregion:usconversational
nocturne23/gemma-3-12b-it-q4_k_m-gguf visual
Downloads
2,092
Likes
2
Pipeline
image-text-to-text
Library
transformers
Visibility
Public
Access
Open

Repository Files & Downloads

1 files detected
Direct downloads for all repository files
FileTypeQuantizationSizeLink
gemma-3-12b-it-q4_k_m.gguf GGUF Q4_K_M 6.80 GB Download

Model Details Live

Model Slug
nocturne23/gemma-3-12b-it-q4_k_m-gguf
Author
nocturne23
Pipeline Task
image-text-to-text
Library
transformers
Created
2025-03-13
Last Modified
2025-03-13
Gated
No
Private
No
HF SHA
2dbea714de33dbdd7fa37e33856330a51c45d116
License
gemma
Language
Unknown
Base Model
google/gemma-3-12b-it

Metadata Inspector

Normalized metadata (stored in metadata_json)
{
  "metadata": {},
  "card_data": {
    "base_model": "google/gemma-3-12b-it",
    "library_name": "transformers",
    "license": "gemma",
    "pipeline_tag": "image-text-to-text",
    "tags": [
      "llama-cpp",
      "gguf-my-repo"
    ],
    "extra_gated_heading": "Access Gemma on Hugging Face",
    "extra_gated_prompt": "To access Gemma on Hugging Face, you’re required to review and agree to Google’s usage license. To do this, please ensure you’re logged in to Hugging Face and click below. Requests are processed immediately.",
    "extra_gated_button_content": "Acknowledge license",
    "frontmatter": {
      "base_model": "google/gemma-3-12b-it",
      "library_name": "transformers",
      "license": "gemma",
      "pipeline_tag": "image-text-to-text",
      "tags": [
        "llama-cpp",
        "gguf-my-repo"
      ],
      "extra_gated_heading": "Access Gemma on Hugging Face",
      "extra_gated_prompt": "To access Gemma on Hugging Face, you’re required to review and",
      "extra_gated_button_content": "Acknowledge license"
    },
    "hero_image_url": "",
    "summary": "This model was converted to GGUF format from google/gemma-3-12b-it using llama.cpp via the ggml.ai's GGUF-my-repo space. Refer to the original model card for more details on the model.",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nbase_model: google/gemma-3-12b-it\nlibrary_name: transformers\nlicense: gemma\npipeline_tag: image-text-to-text\ntags:\n- llama-cpp\n- gguf-my-repo\nextra_gated_heading: Access Gemma on Hugging Face\nextra_gated_prompt: To access Gemma on Hugging Face, you’re required to review and\n  agree to Google’s usage license. To do this, please ensure you’re logged in to Hugging\n  Face and click below. Requests are processed immediately.\nextra_gated_button_content: Acknowledge license\n---\n\n# nocturne23/gemma-3-12b-it-Q4_K_M-GGUF\nThis model was converted to GGUF format from [`google/gemma-3-12b-it`](https://huggingface.co/google/gemma-3-12b-it) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.\nRefer to the [original model card](https://huggingface.co/google/gemma-3-12b-it) for more details on the model.\n\n## Use with llama.cpp\nInstall llama.cpp through brew (works on Mac and Linux)\n\n```bash\nbrew install llama.cpp\n\n```\nInvoke the llama.cpp server or the CLI.\n\n### CLI:\n```bash\nllama-cli --hf-repo nocturne23/gemma-3-12b-it-Q4_K_M-GGUF --hf-file gemma-3-12b-it-q4_k_m.gguf -p \"The meaning to life and the universe is\"\n```\n\n### Server:\n```bash\nllama-server --hf-repo nocturne23/gemma-3-12b-it-Q4_K_M-GGUF --hf-file gemma-3-12b-it-q4_k_m.gguf -c 2048\n```\n\nNote: You can also use this checkpoint directly through the [usage steps](https://github.com/ggerganov/llama.cpp?tab=readme-ov-file#usage) listed in the Llama.cpp repo as well.\n\nStep 1: Clone llama.cpp from GitHub.\n```\ngit clone https://github.com/ggerganov/llama.cpp\n```\n\nStep 2: Move into the llama.cpp folder and build it with `LLAMA_CURL=1` flag along with other hardware-specific flags (for ex: LLAMA_CUDA=1 for Nvidia GPUs on Linux).\n```\ncd llama.cpp && LLAMA_CURL=1 make\n```\n\nStep 3: Run inference through the main binary.\n```\n./llama-cli --hf-repo nocturne23/gemma-3-12b-it-Q4_K_M-GGUF --hf-file gemma-3-12b-it-q4_k_m.gguf -p \"The meaning to life and the universe is\"\n```\nor \n```\n./llama-server --hf-repo nocturne23/gemma-3-12b-it-Q4_K_M-GGUF --hf-file gemma-3-12b-it-q4_k_m.gguf -c 2048\n```\n",
    "related_quantizations": []
  },
  "tags": [
    "transformers",
    "gguf",
    "llama-cpp",
    "gguf-my-repo",
    "image-text-to-text",
    "base_model:google/gemma-3-12b-it",
    "base_model:quantized:google/gemma-3-12b-it",
    "license:gemma",
    "endpoints_compatible",
    "region:us",
    "conversational"
  ],
  "likes": 2,
  "downloads": 2092,
  "gated": false,
  "private": false,
  "last_modified": "2025-03-13T01:05:07.000Z",
  "created_at": "2025-03-13T01:04:19.000Z",
  "pipeline_tag": "image-text-to-text",
  "library_name": "transformers"
}
Source payload excerpt (from Hugging Face API)
{
  "_id": "67d22f13b2f89bdc4179db24",
  "id": "nocturne23/gemma-3-12b-it-Q4_K_M-GGUF",
  "modelId": "nocturne23/gemma-3-12b-it-Q4_K_M-GGUF",
  "sha": "2dbea714de33dbdd7fa37e33856330a51c45d116",
  "createdAt": "2025-03-13T01:04:19.000Z",
  "lastModified": "2025-03-13T01:05:07.000Z",
  "author": "nocturne23",
  "downloads": 2092,
  "likes": 2,
  "gated": false,
  "private": false,
  "pipeline_tag": "image-text-to-text",
  "library_name": "transformers",
  "siblings_count": 3
}