Model Intelligence Sheet
richardyoung/qwen3-14b-abliterated-gguf overview
An abliterated (uncensored) version of Qwen/Qwen3-14B in GGUF format for local inference.
Downloads
1,108
Likes
0
Pipeline
text-generation
Library
—
Visibility
Public
Access
Open
Repository Files & Downloads
6 files detected
Direct downloads for all repository files
| File | Type | Quantization | Size | Link |
|---|---|---|---|---|
| Qwen3-14B-abliterated-IQ3_M.gguf | GGUF | IQ3_M | 6.41 GB | Download |
| Qwen3-14B-abliterated-IQ4_XS.gguf | GGUF | IQ4_XS | 7.62 GB | Download |
| Qwen3-14B-abliterated-Q4_K_M.gguf | GGUF | Q4_K_M | 8.38 GB | Download |
| Qwen3-14B-abliterated-Q5_K_M.gguf | GGUF | Q5_K_M | 9.79 GB | Download |
| Qwen3-14B-abliterated-Q8_0.gguf | GGUF | — | 14.62 GB | Download |
| Qwen3-14B-abliterated-f16.gguf | GGUF | F16 | 27.51 GB | Download |
Model Details Live
Metadata Inspector
Normalized metadata (stored in metadata_json)
{
"metadata": {},
"card_data": {
"license": "apache-2.0",
"base_model": "Qwen/Qwen3-14B",
"tags": [
"abliterated",
"uncensored",
"qwen3",
"gguf",
"llama-cpp",
"ollama",
"lm-studio",
"quantized"
],
"pipeline_tag": "text-generation",
"language": [
"en"
],
"frontmatter": {
"license": "apache-2.0",
"base_model": "Qwen/Qwen3-14B",
"tags": [
"abliterated",
"uncensored",
"qwen3",
"gguf",
"llama-cpp",
"ollama",
"lm-studio",
"quantized"
],
"pipeline_tag": "text-generation",
"language": [
"en"
]
},
"hero_image_url": "",
"summary": "An **abliterated** (uncensored) version of Qwen/Qwen3-14B in GGUF format for local inference.",
"quick_links": [],
"benchmark_table_html": "",
"readme_markdown": "---\nlicense: apache-2.0\nbase_model: Qwen/Qwen3-14B\ntags:\n - abliterated\n - uncensored\n - qwen3\n - gguf\n - llama-cpp\n - ollama\n - lm-studio\n - quantized\npipeline_tag: text-generation\nlanguage:\n - en\n---\n\n# Qwen3-14B Abliterated (GGUF)\n\nAn **abliterated** (uncensored) version of [Qwen/Qwen3-14B](https://huggingface.co/Qwen/Qwen3-14B) in GGUF format for local inference.\n\n## Abliteration Results\n\n| Metric | Value |\n|---|---|\n| Base Refusals | 97/100 |\n| Abliterated Refusals | 19/100 |\n| Refusal Reduction | **80%** |\n| KL Divergence | 0.98 |\n\nConservative abliteration preserves model coherence while significantly reducing refusals.\n\n## Quick Start\n\n### With Ollama\n\n```bash\nollama run hf.co/richardyoung/Qwen3-14B-abliterated-GGUF\n```\n\n### With llama.cpp\n\n```bash\nhuggingface-cli download richardyoung/Qwen3-14B-abliterated-GGUF \\\n --include \"*Q4_K_M*\" --local-dir ./models\n\n./llama-cli -m ./models/*Q4_K_M*.gguf \\\n -p \"You are a helpful assistant.\" \\\n --chat-template chatml -ngl 99\n```\n\n### With Python (llama-cpp-python)\n\n```python\nfrom llama_cpp import Llama\n\nllm = Llama.from_pretrained(\n repo_id=\"richardyoung/Qwen3-14B-abliterated-GGUF\",\n filename=\"*Q4_K_M*\",\n n_gpu_layers=-1,\n)\n\noutput = llm.create_chat_completion(\n messages=[{\"role\": \"user\", \"content\": \"Explain abliteration in simple terms.\"}]\n)\nprint(output[\"choices\"][0][\"message\"][\"content\"])\n```\n\n## Available Quantizations\n\n| Quantization | Use Case |\n|---|---|\n| Q4_K_M | **Recommended** — good balance |\n| Q5_K_M | Higher quality |\n| Q8_0 | Maximum quality |\n\n## What is Abliteration?\n\nAbliteration removes the \"refusal direction\" from a model's residual stream — a surgical modification that disables safety refusals without retraining. See [Refusal in Language Models Is Mediated by a Single Direction](https://arxiv.org/abs/2406.11717).\n\n## Intended Use\n\nResearch, creative writing, education on alignment techniques, and unrestricted local inference.\n\n## Other Models by richardyoung\n\n- **Abliterated/Uncensored models**: [Qwen2.5-7B](https://hf.co/richardyoung/Qwen2.5-7B-Instruct-abliterated-GGUF) | [Qwen3-14B](https://hf.co/richardyoung/Qwen3-14B-abliterated-GGUF) | [DeepSeek-R1-32B](https://hf.co/richardyoung/Deepseek-R1-Distill-Qwen-32b-uncensored) | [Qwen3-8B](https://hf.co/richardyoung/Qwen3-8B-Abliterated)\n- **MLX quantizations (Apple Silicon)**: [Kimi-K2 series](https://hf.co/richardyoung/Kimi-K2-Instruct-0905-MLX-4bit) | [olmOCR MLX](https://hf.co/richardyoung/olmOCR-2-7B-1025-MLX-4bit)\n- **OCR & Vision**: [olmOCR GGUF](https://hf.co/richardyoung/olmOCR-2-7B-1025-GGUF)\n- **Healthcare/Medical**: [Synthea 575K patients dataset](https://hf.co/datasets/richardyoung/synthea-575k-patients) | [CardioEmbed](https://hf.co/richardyoung/CardioEmbed)\n- **Research**: [LLM Instruction-Following Evaluation](https://hf.co/richardyoung/llm-instruction-following-paper) (arxiv:2510.18892)\n",
"related_quantizations": []
},
"tags": [
"gguf",
"abliterated",
"uncensored",
"qwen3",
"llama-cpp",
"ollama",
"lm-studio",
"quantized",
"text-generation",
"en",
"arxiv:2406.11717",
"arxiv:2510.18892",
"base_model:Qwen/Qwen3-14B",
"base_model:quantized:Qwen/Qwen3-14B",
"license:apache-2.0",
"endpoints_compatible",
"region:us",
"conversational"
],
"likes": 0,
"downloads": 1108,
"gated": false,
"private": false,
"last_modified": "2026-03-23T02:06:58.000Z",
"created_at": "2025-12-01T23:20:46.000Z",
"pipeline_tag": "text-generation",
"library_name": ""
}
Source payload excerpt (from Hugging Face API)
{
"_id": "692e22ceab5a6f3ef99f2c02",
"id": "richardyoung/Qwen3-14B-abliterated-GGUF",
"modelId": "richardyoung/Qwen3-14B-abliterated-GGUF",
"sha": "f119126ae9af8e10861ae2b1d8399b065a3d8091",
"createdAt": "2025-12-01T23:20:46.000Z",
"lastModified": "2026-03-23T02:06:58.000Z",
"author": "richardyoung",
"downloads": 1108,
"likes": 0,
"gated": false,
"private": false,
"pipeline_tag": "text-generation",
"library_name": "",
"siblings_count": 8
}