Dzluck/gemma-4-E2B-it-uncensored-GGUF overview
gemma 4 E2B it uncensored GGUF GGUF quantizations of TrevorJS/gemma 4 E2B it uncensored https://huggingface.co/TrevorJS/gemma 4 E2B it uncensored . Files | Fil…
Runs locally from ~3.19 GB disk (4 GB VRAM class GPUs with llama.cpp / guIDE).
Repository Files & Downloads
Model Details
| Model ID | Dzluck/gemma-4-E2B-it-uncensored-GGUF |
|---|---|
| Author | Dzluck |
| Pipeline | text-generation |
| License | apache-2.0 |
| Base model | TrevorJS/gemma-4-E2B-it-uncensored |
| Last modified | 2026-06-19T04:32:27.000Z |
Model README
---
base_model: TrevorJS/gemma-4-E2B-it-uncensored
base_model_relation: quantized
pipeline_tag: text-generation
language:
- en
license: apache-2.0
tags:
- abliteration
- uncensored
- gemma-4
- gguf
---
gemma-4-E2B-it-uncensored (GGUF)
GGUF quantizations of TrevorJS/gemma-4-E2B-it-uncensored.
Files
| File | Quant | Size |
|------|-------|------|
| gemma-4-E2B-it-uncensored-Q4_K_M.gguf | Q4_K_M | 3.4 GB |
| gemma-4-E2B-it-uncensored-Q8_0.gguf | Q8_0 | 5.0 GB |
Usage
# From HuggingFace (auto-downloads)
llama-server -hf TrevorJS/gemma-4-E2B-it-uncensored-GGUF -c 8192
# From local file
llama-server -m gemma-4-E2B-it-uncensored-Q4_K_M.gguf -c 8192
Then open http://localhost:8080 for the chat UI.
Details
These are GGUF quantizations of TrevorJS/gemma-4-E2B-it-uncensored, an abliterated
(uncensored) version of google/gemma-4-E2B-it.
Refusal behavior has been removed using norm-preserving biprojected abliteration.
See the bf16 model card for full method details,
before/after refusal rates, and cross-dataset validation results.
Source code: TrevorJS/gemma-4-abliteration
Run Dzluck/gemma-4-E2B-it-uncensored-GGUF with guIDE
Download guIDE — the AI-native code editor with local LLM inference and 69 built-in tools.
Source: Hugging Face · Compare models