GraySoft
Projects Models About FAQ Contact Download guIDE →

abiray/gemma-4-e4b-gemini-3.1-pro-reasoning-distill-gguf Q3_K_M GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.

Model Intelligence Sheet

abiray/gemma-4-e4b-gemini-3.1-pro-reasoning-distill-gguf overview

This repository contains GGUF format model files for Ayodele01's gemma-4-E4B-Gemini-3.1-Pro-Reasoning-Distill. These models were compiled and quantized via llama.cpp to enable efficient local inference on consumer hardware.

ggufquantizedgemmatext-generationbase_model:Ayodele01/gemma-4-E4B-Gemini-3.1-Pro-Reasoning-Distillbase_model:quantized:Ayodele01/gemma-4-E4B-Gemini-3.1-Pro-Reasoning-Distillendpoints_compatibleregion:usconversational
abiray/gemma-4-e4b-gemini-3.1-pro-reasoning-distill-gguf visual
Downloads
5,330
Likes
4
Pipeline
text-generation
Library
gguf
Visibility
Public
Access
Open

Repository Files & Downloads

5 files detected
Direct downloads for all repository files
FileTypeQuantizationSizeLink
gemma-4-E4B-Gemini-3.1-Pro-Reasoning-Distill-Q3_K_M.gguf GGUF Q3_K_M 4.52 GB Download
gemma-4-E4B-Gemini-3.1-Pro-Reasoning-Distill-Q4_K_M.gguf GGUF Q4_K_M 4.97 GB Download
gemma-4-E4B-Gemini-3.1-Pro-Reasoning-Distill-Q5_K_M.gguf GGUF Q5_K_M 5.37 GB Download
gemma-4-E4B-Gemini-3.1-Pro-Reasoning-Distill-Q6_K.gguf GGUF Q6_K 5.79 GB Download
gemma-4-E4B-Gemini-3.1-Pro-Reasoning-Distill-Q8_0.gguf GGUF 7.48 GB Download

Model Details Live

Model Slug
abiray/gemma-4-e4b-gemini-3.1-pro-reasoning-distill-gguf
Author
Abiray
Pipeline Task
text-generation
Library
gguf
Created
2026-04-04
Last Modified
2026-04-04
Gated
No
Private
No
HF SHA
797655194d2331cf4ab5659cd6b14e93ced07d62
License
Unknown
Language
Unknown
Base Model
Ayodele01/gemma-4-E4B-Gemini-3.1-Pro-Reasoning-Distill

Metadata Inspector

Normalized metadata (stored in metadata_json)
{
  "metadata": {},
  "card_data": {
    "base_model": "Ayodele01/gemma-4-E4B-Gemini-3.1-Pro-Reasoning-Distill",
    "library_name": "gguf",
    "pipeline_tag": "text-generation",
    "tags": [
      "gguf",
      "quantized",
      "gemma"
    ],
    "frontmatter": {
      "base_model": "Ayodele01/gemma-4-E4B-Gemini-3.1-Pro-Reasoning-Distill",
      "library_name": "gguf",
      "pipeline_tag": "text-generation",
      "tags": [
        "gguf",
        "quantized",
        "gemma"
      ]
    },
    "hero_image_url": "",
    "summary": "This repository contains GGUF format model files for Ayodele01's gemma-4-E4B-Gemini-3.1-Pro-Reasoning-Distill. These models were compiled and quantized via llama.cpp to enable efficient local inference on consumer hardware.",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nbase_model: Ayodele01/gemma-4-E4B-Gemini-3.1-Pro-Reasoning-Distill\nlibrary_name: gguf\npipeline_tag: text-generation\ntags:\n- gguf\n- quantized\n- gemma\n---\n\n# gemma-4-E4B-Gemini-3.1-Pro-Reasoning-Distill-GGUF\n\nThis repository contains GGUF format model files for [Ayodele01's gemma-4-E4B-Gemini-3.1-Pro-Reasoning-Distill](https://huggingface.co/Ayodele01/gemma-4-E4B-Gemini-3.1-Pro-Reasoning-Distill). \n\nThese models were compiled and quantized via `llama.cpp` to enable efficient local inference on consumer hardware.\n\n## Available Quantizations\n\n| File Name | Description |\n|---|---|\n| `gemma-4-E4B-Gemini-3.1-Pro-Reasoning-Distill-Q8_0.gguf` | 8-bit quantization. Near unquantized performance, largest file size. |\n| `gemma-4-E4B-Gemini-3.1-Pro-Reasoning-Distill-Q6_K.gguf` | 6-bit quantization. Very high quality, minimal degradation from original. |\n| `gemma-4-E4B-Gemini-3.1-Pro-Reasoning-Distill-Q5_K_M.gguf` | 5-bit quantization. Higher quality, slightly larger size and slower inference. |\n| `gemma-4-E4B-Gemini-3.1-Pro-Reasoning-Distill-Q4_K_M.gguf` | 4-bit quantization. **Recommended**. Excellent balance of speed, memory usage, and quality. |\n| `gemma-4-E4B-Gemini-3.1-Pro-Reasoning-Distill-Q3_K_M.gguf` | 3-bit quantization. Very high compression, fast inference, lower quality. |\n",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "quantized",
    "gemma",
    "text-generation",
    "base_model:Ayodele01/gemma-4-E4B-Gemini-3.1-Pro-Reasoning-Distill",
    "base_model:quantized:Ayodele01/gemma-4-E4B-Gemini-3.1-Pro-Reasoning-Distill",
    "endpoints_compatible",
    "region:us",
    "conversational"
  ],
  "likes": 4,
  "downloads": 5330,
  "gated": false,
  "private": false,
  "last_modified": "2026-04-04T13:16:47.000Z",
  "created_at": "2026-04-04T13:10:41.000Z",
  "pipeline_tag": "text-generation",
  "library_name": "gguf"
}
Source payload excerpt (from Hugging Face API)
{
  "_id": "69d10dd186f23ed3d819a014",
  "id": "Abiray/gemma-4-E4B-Gemini-3.1-Pro-Reasoning-Distill-GGUF",
  "modelId": "Abiray/gemma-4-E4B-Gemini-3.1-Pro-Reasoning-Distill-GGUF",
  "sha": "797655194d2331cf4ab5659cd6b14e93ced07d62",
  "createdAt": "2026-04-04T13:10:41.000Z",
  "lastModified": "2026-04-04T13:16:47.000Z",
  "author": "Abiray",
  "downloads": 5330,
  "likes": 4,
  "gated": false,
  "private": false,
  "pipeline_tag": "text-generation",
  "library_name": "gguf",
  "siblings_count": 7
}