brittlewis12/gemma-3-1b-it-gguf IQ1_S GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.

Model Intelligence Sheet

brittlewis12/gemma-3-1b-it-gguf overview

Original model: Gemma 3 1B IT Model creator: Google DeepMind Gemma is a family of lightweight, state-of-the-art open models from Google, built from the same research and technology used to create the Gemini models. Gemma 3 has a large, 128K context window, multilingual support in over 140 languages, and is available in more sizes than previous versions. Gemma 3 models are well-suited for a variety of text generation and image understanding tasks, including question answering, summarization, and reasoning. Their relatively small size makes it possible to deploy them in environments with limited resources such as laptops, desktops or your own cloud infrastructure, democratizing access to state of the art AI models and helping foster innovation for everyone. This repo contains GGUF format model files for Google DeepMind’s Gemma 3 1B IT (instruction-tuned). ### What is GGUF? GGUF is a file format for representing AI models. It is the third version of the format, introduced by the llama.cpp team on August 21st 2023. Converted with llama.cpp build b4875 (revision 7841fc7), using autogguf-rs. ### Prompt template: Gemma Instruct ---

ggufreasoninggemmatext-generationenarxiv:1905.07830arxiv:1905.10044arxiv:1911.11641arxiv:1904.09728arxiv:1705.03551arxiv:1911.01547arxiv:1907.10641arxiv:1903.00161arxiv:2210.03057arxiv:2106.03193arxiv:1910.11856arxiv:2502.12404arxiv:2502.21228arxiv:2404.16816base_model:google/gemma-3-1b-itbase_model:quantized:google/gemma-3-1b-itlicense:gemmaendpoints_compatibleregion:usconversational

Downloads

267

Likes

Pipeline

text-generation

Library

—

Visibility

Public

Access

Open

Repository Files & Downloads

28 files detected

Direct downloads for all repository files

File	Type	Quantization	Size	Link
gemma-3-1b-it.IQ1_M.gguf	GGUF	IQ1_M	613.68 MB	Download
gemma-3-1b-it.IQ1_S.gguf	GGUF	IQ1_S	609.58 MB	Download
gemma-3-1b-it.IQ2_M.gguf	GGUF	IQ2_M	638.76 MB	Download
gemma-3-1b-it.IQ2_S.gguf	GGUF	IQ2_S	633.30 MB	Download
gemma-3-1b-it.IQ2_XS.gguf	GGUF	IQ2_XS	626.87 MB	Download
gemma-3-1b-it.IQ2_XXS.gguf	GGUF	IQ2_XXS	620.50 MB	Download
gemma-3-1b-it.IQ3_M.gguf	GGUF	IQ3_M	664.77 MB	Download
gemma-3-1b-it.IQ3_S.gguf	GGUF	IQ3_S	657.86 MB	Download
gemma-3-1b-it.IQ3_XS.gguf	GGUF	IQ3_XS	657.86 MB	Download
gemma-3-1b-it.IQ3_XXS.gguf	GGUF	IQ3_XXS	648.60 MB	Download
gemma-3-1b-it.IQ4_NL.gguf	GGUF	IQ4_NL	688.42 MB	Download
gemma-3-1b-it.IQ4_XS.gguf	GGUF	IQ4_XS	681.34 MB	Download
gemma-3-1b-it.Q2_K.gguf	GGUF	Q2_K	657.86 MB	Download
gemma-3-1b-it.Q2_K_S.gguf	GGUF	Q2_K_S	640.18 MB	Download
gemma-3-1b-it.Q3_K_L.gguf	GGUF	Q3_K_L	716.76 MB	Download
gemma-3-1b-it.Q3_K_M.gguf	GGUF	Q3_K_M	688.95 MB	Download
gemma-3-1b-it.Q3_K_S.gguf	GGUF	Q3_K_S	656.94 MB	Download
gemma-3-1b-it.Q4_0.gguf	GGUF	—	687.05 MB	Download
gemma-3-1b-it.Q4_1.gguf	GGUF	—	728.64 MB	Download
gemma-3-1b-it.Q4_K_M.gguf	GGUF	Q4_K_M	768.72 MB	Download
gemma-3-1b-it.Q4_K_S.gguf	GGUF	Q4_K_S	744.81 MB	Download
gemma-3-1b-it.Q5_0.gguf	GGUF	—	770.23 MB	Download
gemma-3-1b-it.Q5_1.gguf	GGUF	—	811.82 MB	Download
gemma-3-1b-it.Q5_K_M.gguf	GGUF	Q5_K_M	811.91 MB	Download
gemma-3-1b-it.Q5_K_S.gguf	GGUF	Q5_K_S	797.65 MB	Download
gemma-3-1b-it.Q6_K.gguf	GGUF	Q6_K	964.87 MB	Download
gemma-3-1b-it.Q8_0.gguf	GGUF	—	1019.77 MB	Download
gemma-3-1b-it.f16.gguf	GGUF	F16	1.87 GB	Download

Model Details Live

Model Slug

brittlewis12/gemma-3-1b-it-gguf

Author

brittlewis12

Pipeline Task

text-generation

Library

—

Created

2025-03-12

Last Modified

2025-03-12

Gated

Private

HF SHA

cea4b931f71746a6d94df36cff58a85f232b5ef5

License

gemma

Language

Base Model

google/gemma-3-1b-it

Metadata Inspector

Normalized metadata (stored in metadata_json)

{
  "metadata": {},
  "card_data": {
    "base_model": "google/gemma-3-1b-it",
    "pipeline_tag": "text-generation",
    "inference": true,
    "language": [
      "en"
    ],
    "license": "gemma",
    "model_creator": "google",
    "model_name": "gemma-3-1b-it",
    "model_type": "gemma3_text",
    "quantized_by": "brittlewis12",
    "tags": [
      "reasoning",
      "gemma"
    ],
    "frontmatter": {
      "base_model": "google/gemma-3-1b-it",
      "pipeline_tag": "text-generation",
      "inference": "true",
      "language": [
        "en"
      ],
      "license": "gemma",
      "model_creator": "google",
      "model_name": "gemma-3-1b-it",
      "model_type": "gemma3_text",
      "quantized_by": "brittlewis12",
      "tags": [
        "reasoning",
        "gemma"
      ]
    },
    "hero_image_url": "https://pbs.twimg.com/profile_images/1744049151241797632/0mIP-P9e_400x400.jpg",
    "summary": "**Original model**: Gemma 3 1B IT **Model creator**: Google DeepMind > Gemma is a family of lightweight, state-of-the-art open models from Google, built from the same research and technology used to create the Gemini models. > Gemma 3 has a large, 128K context window, multilingual support in over 140 languages, and is available in more sizes than previous versions. Gemma 3 models are well-suited for a variety of text generation and image understanding tasks, including question answering, summarization, and reasoning. Their relatively small size makes it possible to deploy them in environments with limited resources such as laptops, desktops or your own cloud infrastructure, democratizing access to state of the art AI models and helping foster innovation for everyone. This repo contains GGUF format model files for Google DeepMind’s Gemma 3 1B IT (instruction-tuned). ### What is GGUF? GGUF is a file format for representing AI models. It is the third version of the format, introduced by the llama.cpp team on August 21st 2023. Converted with llama.cpp build b4875 (revision 7841fc7), using autogguf-rs. ### Prompt template: Gemma Instruct `` {{system_prompt}} user {{prompt}} model `` ---",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nbase_model: google/gemma-3-1b-it\npipeline_tag: text-generation\ninference: true\nlanguage:\n- en\nlicense: gemma\nmodel_creator: google\nmodel_name: gemma-3-1b-it\nmodel_type: gemma3_text\nquantized_by: brittlewis12\ntags:\n- reasoning\n- gemma\n---\n\n# Gemma 3 1B IT GGUF\n\n**Original model**: [Gemma 3 1B IT](https://huggingface.co/google/gemma-3-1b-it)\n\n**Model creator**: [Google DeepMind](https://huggingface.co/google)\n\n> Gemma is a family of lightweight, state-of-the-art open models from Google, built from the same research and technology used to create the Gemini models. \n\n> Gemma 3 has a large, 128K context window, multilingual support in over 140 languages, and is available in more sizes than previous versions. Gemma 3 models are well-suited for a variety of text generation and image understanding tasks, including question answering, summarization, and reasoning. Their relatively small size makes it possible to deploy them in environments with limited resources such as laptops, desktops or your own cloud infrastructure, democratizing access to state of the art AI models and helping foster innovation for everyone.\n\nThis repo contains GGUF format model files for Google DeepMind’s Gemma 3 1B IT (instruction-tuned).\n\n### What is GGUF?\n\nGGUF is a file format for representing AI models. It is the third version of the format, \nintroduced by the llama.cpp team on August 21st 2023.\n\nConverted with llama.cpp build b4875 (revision [7841fc7](https://github.com/ggml-org/llama.cpp/commits/7841fc723e059d1fd9640e5c0ef19050fcc7c698)),\nusing [autogguf-rs](https://github.com/brittlewis12/autogguf-rs).\n\n### Prompt template: [Gemma Instruct](https://huggingface.co/google/gemma-3-1b-it/raw/main/tokenizer_config.json)\n\n```\n{{system_prompt}}\n<start_of_turn>user\n{{prompt}}<end_of_turn>\n<start_of_turn>model\n\n\n```\n\n---\n\n## Download & run with [cnvrs](https://twitter.com/cnvrsai) on iPhone, iPad, and Mac!\n\n![cnvrs.ai](https://pbs.twimg.com/profile_images/1744049151241797632/0mIP-P9e_400x400.jpg)\n\n[cnvrs](https://testflight.apple.com/join/sFWReS7K) is the best app for private, local AI on your device:\n- create & save **Characters** with custom system prompts & temperature settings\n- download and experiment with any **GGUF model** you can [find on HuggingFace](https://huggingface.co/models?library=gguf)!\n    * or, use an API key with the chat completions-compatible model provider of your choice -- ChatGPT, Claude, Gemini, DeepSeek, & more!\n- make it your own with custom **Theme colors**\n- powered by Metal ⚡️ & [Llama.cpp](https://github.com/ggml-org/llama.cpp), with **haptics** during response streaming!\n- **try it out** yourself today, on [Testflight](https://testflight.apple.com/join/sFWReS7K)!\n    * if you **already have the app**, download Gemma 3 1B IT now! \n    * <cnvrsai:///models/search/hf?id=brittlewis12/gemma-3-1b-it-GGUF>\n- follow [cnvrs on twitter](https://twitter.com/cnvrsai) to stay up to date\n\n### Gemma 3 1B IT in cnvrs on macOS\n\n![gemma-3 in cnvrs](https://cdn-uploads.huggingface.co/production/uploads/63b64d7a889aa6707f155cdb/dt0eLAU9pKyKnuu8oCw0D.png)\n\n---\n\n## Original Model Evaluation\n\n> These models were evaluated against a large collection of different datasets and\nmetrics to cover different aspects of text generation:\n\n#### Reasoning and factuality\n\n| Benchmark                      | Metric         | Gemma 3 PT 1B  | Gemma 3 PT 4B | Gemma 3 PT 12B | Gemma 3 PT 27B |\n| ------------------------------ |----------------|:--------------:|:-------------:|:--------------:|:--------------:|\n| [HellaSwag][hellaswag]         | 10-shot        |      62.3      |      77.2     |      84.2      |      85.6      |\n| [BoolQ][boolq]                 | 0-shot         |      63.2      |      72.3     |      78.8      |      82.4      |\n| [PIQA][piqa]                   | 0-shot         |      73.8      |      79.6     |      81.8      |      83.3      |\n| [SocialIQA][socialiqa]         | 0-shot         |      48.9      |      51.9     |      53.4      |      54.9      |\n| [TriviaQA][triviaqa]           | 5-shot         |      39.8      |      65.8     |      78.2      |      85.5      |\n| [Natural Questions][naturalq]  | 5-shot         |      9.48      |      20.0     |      31.4      |      36.1      |\n| [ARC-c][arc]                   | 25-shot        |      38.4      |      56.2     |      68.9      |      70.6      |\n| [ARC-e][arc]                   | 0-shot         |      73.0      |      82.4     |      88.3      |      89.0      |\n| [WinoGrande][winogrande]       | 5-shot         |      58.2      |      64.7     |      74.3      |      78.8      |\n| [BIG-Bench Hard][bbh]          | few-shot       |      28.4      |      50.9     |      72.6      |      77.7      |\n| [DROP][drop]                   | 1-shot         |      42.4      |      60.1     |      72.2      |      77.2      |\n\n[hellaswag]: https://arxiv.org/abs/1905.07830\n[boolq]: https://arxiv.org/abs/1905.10044\n[piqa]: https://arxiv.org/abs/1911.11641\n[socialiqa]: https://arxiv.org/abs/1904.09728\n[triviaqa]: https://arxiv.org/abs/1705.03551\n[naturalq]: https://github.com/google-research-datasets/natural-questions\n[arc]: https://arxiv.org/abs/1911.01547\n[winogrande]: https://arxiv.org/abs/1907.10641\n[bbh]: https://paperswithcode.com/dataset/bbh\n[drop]: https://arxiv.org/abs/1903.00161\n\n#### Multilingual\n\n| Benchmark                            | Gemma 3 PT 1B | Gemma 3 PT 4B | Gemma 3 PT 12B | Gemma 3 PT 27B |\n| ------------------------------------ |:-------------:|:-------------:|:--------------:|:--------------:|\n| [MGSM][mgsm]                         |      2.04     |      34.7     |      64.3     |      74.3     |\n| [Global-MMLU-Lite][global-mmlu-lite] |      24.9     |      57.0     |      69.4     |      75.7     |\n| [WMT24++][wmt24pp] (ChrF)            |      36.7     |      48.4     |      53.9     |      55.7     |\n| [FloRes][flores]                     |      29.5     |      39.2     |      46.0     |      48.8     |\n| [XQuAD][xquad] (all)                 |      43.9     |      68.0     |      74.5     |      76.8     |\n| [ECLeKTic][eclektic]                 |      4.69     |      11.0     |      17.2     |      24.4     |\n| [IndicGenBench][indicgenbench]       |      41.4     |      57.2     |      61.7     |      63.4     |\n\n[mgsm]: https://arxiv.org/abs/2210.03057\n[flores]: https://arxiv.org/abs/2106.03193\n[xquad]: https://arxiv.org/abs/1910.11856v3\n[global-mmlu-lite]: https://huggingface.co/datasets/CohereForAI/Global-MMLU-Lite\n[wmt24pp]: https://arxiv.org/abs/2502.12404v1\n[eclektic]: https://arxiv.org/abs/2502.21228\n[indicgenbench]: https://arxiv.org/abs/2404.16816\n",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "reasoning",
    "gemma",
    "text-generation",
    "en",
    "arxiv:1905.07830",
    "arxiv:1905.10044",
    "arxiv:1911.11641",
    "arxiv:1904.09728",
    "arxiv:1705.03551",
    "arxiv:1911.01547",
    "arxiv:1907.10641",
    "arxiv:1903.00161",
    "arxiv:2210.03057",
    "arxiv:2106.03193",
    "arxiv:1910.11856",
    "arxiv:2502.12404",
    "arxiv:2502.21228",
    "arxiv:2404.16816",
    "base_model:google/gemma-3-1b-it",
    "base_model:quantized:google/gemma-3-1b-it",
    "license:gemma",
    "endpoints_compatible",
    "region:us",
    "conversational"
  ],
  "likes": 2,
  "downloads": 267,
  "gated": false,
  "private": false,
  "last_modified": "2025-03-12T11:56:38.000Z",
  "created_at": "2025-03-12T07:34:13.000Z",
  "pipeline_tag": "text-generation",
  "library_name": ""
}

Source payload excerpt (from Hugging Face API)

{
  "_id": "67d138f52e461dfe22677c47",
  "id": "brittlewis12/gemma-3-1b-it-GGUF",
  "modelId": "brittlewis12/gemma-3-1b-it-GGUF",
  "sha": "cea4b931f71746a6d94df36cff58a85f232b5ef5",
  "createdAt": "2025-03-12T07:34:13.000Z",
  "lastModified": "2025-03-12T11:56:38.000Z",
  "author": "brittlewis12",
  "downloads": 267,
  "likes": 2,
  "gated": false,
  "private": false,
  "pipeline_tag": "text-generation",
  "library_name": "",
  "siblings_count": 31
}

brittlewis12/gemma-3-1b-it-gguf overview

Repository Files & Downloads

Model Details Live

Metadata Inspector

More models in this shard