GraySoft
Projects Models Compare Cloud benchmarks FAQ Download guIDE →
Model Intelligence Sheet

Dzluck/gemma-4-E4B-Gemini-3.1-Pro-Reasoning-Distill-GGUF overview

gemma 4 E4B Gemini 3.1 Pro Reasoning Distill GGUF This repository contains GGUF format model files for Ayodele01's gemma 4 E4B Gemini 3.1 Pro Reasoning Distill…

ggufquantizedgemmatext-generationbase_model:Ayodele01/gemma-4-E4B-Gemini-3.1-Pro-Reasoning-Distillbase_model:quantized:Ayodele01/gemma-4-E4B-Gemini-3.1-Pro-Reasoning-Distillendpoints_compatibleregion:usconversational

Runs locally from ~4.52 GB disk (8 GB VRAM class GPUs with llama.cpp / guIDE).

Downloads
0
Likes
0
Pipeline
text-generation
Author

Repository Files & Downloads

5 GGUF files detected
Direct downloads for local inference
FileTypeQuantizationSizeLink
gemma-4-E4B-Gemini-3.1-Pro-Reasoning-Distill-Q3_K_M.ggufGGUFQ3_K_M4.52 GBDownload
gemma-4-E4B-Gemini-3.1-Pro-Reasoning-Distill-Q4_K_M.ggufGGUFQ4_K_M4.97 GBDownload
gemma-4-E4B-Gemini-3.1-Pro-Reasoning-Distill-Q5_K_M.ggufGGUFQ5_K_M5.37 GBDownload
gemma-4-E4B-Gemini-3.1-Pro-Reasoning-Distill-Q6_K.ggufGGUFQ6_K5.79 GBDownload
gemma-4-E4B-Gemini-3.1-Pro-Reasoning-Distill-Q8_0.ggufGGUFQ8_07.48 GBDownload

Model Details

Model IDDzluck/gemma-4-E4B-Gemini-3.1-Pro-Reasoning-Distill-GGUF
AuthorDzluck
Pipelinetext-generation
License
Base modelAyodele01/gemma-4-E4B-Gemini-3.1-Pro-Reasoning-Distill
Last modified2026-06-20T15:25:31.000Z

Model README

---

base_model: Ayodele01/gemma-4-E4B-Gemini-3.1-Pro-Reasoning-Distill

library_name: gguf

pipeline_tag: text-generation

tags:

  • gguf
  • quantized
  • gemma

---

gemma-4-E4B-Gemini-3.1-Pro-Reasoning-Distill-GGUF

This repository contains GGUF format model files for Ayodele01's gemma-4-E4B-Gemini-3.1-Pro-Reasoning-Distill.

These models were compiled and quantized via llama.cpp to enable efficient local inference on consumer hardware.

Available Quantizations

| File Name | Description |

|---|---|

| gemma-4-E4B-Gemini-3.1-Pro-Reasoning-Distill-Q8_0.gguf | 8-bit quantization. Near unquantized performance, largest file size. |

| gemma-4-E4B-Gemini-3.1-Pro-Reasoning-Distill-Q6_K.gguf | 6-bit quantization. Very high quality, minimal degradation from original. |

| gemma-4-E4B-Gemini-3.1-Pro-Reasoning-Distill-Q5_K_M.gguf | 5-bit quantization. Higher quality, slightly larger size and slower inference. |

| gemma-4-E4B-Gemini-3.1-Pro-Reasoning-Distill-Q4_K_M.gguf | 4-bit quantization. Recommended. Excellent balance of speed, memory usage, and quality. |

| gemma-4-E4B-Gemini-3.1-Pro-Reasoning-Distill-Q3_K_M.gguf | 3-bit quantization. Very high compression, fast inference, lower quality. |

Run Dzluck/gemma-4-E4B-Gemini-3.1-Pro-Reasoning-Distill-GGUF with guIDE

Download guIDE — the AI-native code editor with local LLM inference and 69 built-in tools.

Download guIDE → · Browse 524k+ models · Compare models

Source: Hugging Face · Compare models