flakily6416/qwen3.5-27b-opus-reasoning-v2-abliterated-evopress-gguf Q6_K GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.

Model Intelligence Sheet

flakily6416/qwen3.5-27b-opus-reasoning-v2-abliterated-evopress-gguf overview

This repository features traditional and EvoPress GGUF quants of an abliterated reasoning model. Built upon Jackrong's Claude-4.6-Opus Distillation v2, the model was uncensored via the Orion-Zhen pipeline before being quantized with the EvoPress mixed-precision strategy.

ggufmambauncensoredreasoningchain-of-thoughtqwen3.5image-text-to-textenzhkodataset:nohurry/Opus-4.6-Reasoning-3000x-filtereddataset:Jackrong/Qwen3.5-reasoning-700xdataset:Roman1111111/claude-opus-4.6-10000xbase_model:Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2base_model:quantized:Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2license:apache-2.0endpoints_compatibleregion:usconversational

flakily6416/qwen3.5-27b-opus-reasoning-v2-abliterated-evopress-gguf visual

Downloads

6,979

Likes

Pipeline

image-text-to-text

Library

—

Visibility

Public

Access

Open

Repository Files & Downloads

8 files detected

Direct downloads for all repository files

File	Type	Quantization	Size	Link
Qwen-3.5-27B-Opus-Reasoning-Abliterated-EP3.5_bpw.gguf	GGUF	—	11.57 GB	Download
Qwen-3.5-27B-Opus-Reasoning-Abliterated-EP4.25_bpw.gguf	GGUF	—	14.37 GB	Download
Qwen-3.5-27B-Opus-Reasoning-Abliterated-EP5.0_bpw.gguf	GGUF	—	16.97 GB	Download
Qwen-3.5-27B-Opus-Reasoning-Abliterated-Q2_K.gguf	GGUF	Q2_K	9.98 GB	Download
Qwen-3.5-27B-Opus-Reasoning-Abliterated-Q3_K.gguf	GGUF	Q3_K	12.39 GB	Download
Qwen-3.5-27B-Opus-Reasoning-Abliterated-Q4_K.gguf	GGUF	Q4_K	15.41 GB	Download
Qwen-3.5-27B-Opus-Reasoning-Abliterated-Q5_K.gguf	GGUF	Q5_K	17.91 GB	Download
Qwen-3.5-27B-Opus-Reasoning-Abliterated-Q6_K.gguf	GGUF	Q6_K	20.57 GB	Download

Model Details Live

Model Slug

flakily6416/qwen3.5-27b-opus-reasoning-v2-abliterated-evopress-gguf

Author

Flakily6416

Pipeline Task

image-text-to-text

Library

—

Created

2026-04-02

Last Modified

2026-04-06

Gated

Private

HF SHA

dc00fca088b233cca176a7601a3ba25b3c633b99

License

apache-2.0

Language

en, zh, ko

Base Model

Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2

Metadata Inspector

Normalized metadata (stored in metadata_json)

{
  "metadata": {},
  "card_data": {
    "license": "apache-2.0",
    "language": [
      "en",
      "zh",
      "ko"
    ],
    "tags": [
      "gguf",
      "mamba",
      "uncensored",
      "reasoning",
      "chain-of-thought",
      "qwen3.5"
    ],
    "base_model": "Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2",
    "pipeline_tag": "image-text-to-text",
    "datasets": [
      "nohurry/Opus-4.6-Reasoning-3000x-filtered",
      "Jackrong/Qwen3.5-reasoning-700x",
      "Roman1111111/claude-opus-4.6-10000x"
    ],
    "frontmatter": {
      "license": "apache-2.0",
      "language": [
        "en",
        "zh",
        "ko"
      ],
      "tags": [
        "gguf",
        "mamba",
        "uncensored",
        "reasoning",
        "chain-of-thought",
        "qwen3.5"
      ],
      "base_model": "Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2",
      "pipeline_tag": "image-text-to-text",
      "datasets": [
        "nohurry/Opus-4.6-Reasoning-3000x-filtered",
        "Jackrong/Qwen3.5-reasoning-700x",
        "Roman1111111/claude-opus-4.6-10000x"
      ]
    },
    "hero_image_url": "",
    "summary": "This repository features traditional and EvoPress GGUF quants of an abliterated reasoning model. Built upon **Jackrong's Claude-4.6-Opus Distillation v2**, the model was uncensored via the **Orion-Zhen pipeline** before being quantized with the **EvoPress** mixed-precision strategy.",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nlicense: apache-2.0\nlanguage:\n- en\n- zh\n- ko\ntags:\n- gguf\n- mamba\n- uncensored\n- reasoning\n- chain-of-thought\n- qwen3.5\nbase_model: Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2\npipeline_tag: image-text-to-text\ndatasets:\n- nohurry/Opus-4.6-Reasoning-3000x-filtered\n- Jackrong/Qwen3.5-reasoning-700x\n- Roman1111111/claude-opus-4.6-10000x\n---\n# Note: This release currently does not have vision capabilities due to an oversight. \nI'll get this fixed as soon as my free lightning ai credits reset (or please get in touch if you would like to sponsor some A100 hours).\n\n# Qwen 3.5 27B Opus-Reasoning v2 (Abliterated) - Mixed Precision GGUFs\n\nThis repository features traditional and EvoPress GGUF quants of an abliterated reasoning model. Built upon [**Jackrong's Claude-4.6-Opus Distillation v2**](https://huggingface.co/Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2), the model was uncensored via the [**Orion-Zhen pipeline**](https://github.com/Orion-zhen/abliteration) before being quantized with the [**EvoPress**](https://huggingface.co/EvoPress) mixed-precision strategy.\n## 🔥 Model Lineage & Highlights\n1. **Base:** [Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2](https://huggingface.co/Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2) \n   - Distilled using 14,000+ Claude 4.6 Opus-style samples to drastically improve Chain-of-Thought (CoT) efficiency. \n   - Reduces unnecessarily long internal reasoning chains while maintaining top-tier benchmark scores (e.g., 96.91% pass@1 on HumanEval).\n2. **Abliteration:** Processed via Orion-Zhen's open-source pipeline.\n\n## ⚡ EvoPress Mixed-Precision Quantization\nThis release utilizes the **EvoPress** methodology to maximize intelligence-per-gigabyte in Hybrid Mamba architectures. Standard quantization often degrades the sensitive State Space Model (SSM) components; these quants solve that by using tiered-precision mapping.\n\n**Key Methodology:** [EvoPress (GitHub/HF)](https://huggingface.co/EvoPress)\n\n| File Name | Target BPW | VRAM Fit | Optimization Strategy |\n| :--- | :---: | :--- | :--- |\n| **Qwen-3.5-27B-Opus-Reasoning-Abliterated-EP3.5_bpw.gguf** | 3.5 | 12GB - 16GB | Q3_K Base + F32 Mamba/Norms |\n| **Qwen-3.5-27B-Opus-Reasoning-Abliterated-EP4.25_bpw.gguf** | 4.25 | 16GB - 24GB | Q4_K Base + F32 Mamba/Norms |\n| **Qwen-3.5-27B-Opus-Reasoning-Abliterated-EP5.0_bpw.gguf** | 5.0 | 24GB+ | Q5_K Base + F32 Mamba/Norms |\n\n## 📊 Standard/Traditional Quantizations\nIncluded for comparison and compatibility with older hardware.\n\n| File | Type | Size |\n| :--- | :--- | :--- |\n| Qwen-3.5-27B-Opus-Reasoning-Abliterated-Q2_K.gguf | Q2_K | ~10 GB |\n| Qwen-3.5-27B-Opus-Reasoning-Abliterated-Q3_K.gguf | Q3_K | ~13 GB |\n| Qwen-3.5-27B-Opus-Reasoning-Abliterated-Q4_K.gguf | Q4_K | ~16 GB |\n| Qwen-3.5-27B-Opus-Reasoning-Abliterated-Q5_K.gguf | Q5_K | ~19 GB |\n| Qwen-3.5-27B-Opus-Reasoning-Abliterated-Q6_K.gguf | Q6_K | ~23 GB |\n\n## 🛠️ Technical Details & Setup\n- **Architecture:** Hybrid Mamba-Transformer (Qwen 3.5)\n- **Quantization:** Performed using a modified `gptq-gguf-toolkit` for Mamba-aware layer mapping.\n- **Requirements:** Use a recent build of `llama.cpp` (March 2026+) for full Hybrid Mamba support.\n\n## 🤝 Credits & Acknowledgements\n- **Jackrong:** For the Claude-4.6-Opus reasoning distillation methodology and published model.\n- **Orion-Zhen:** For the abliteration and refusal-removal pipeline.\n- **Alibaba Qwen Team:** For the base Qwen 3.5 architecture.\n- **Flakily6416:** Quantization, layer-mapping, and Mixed-Precision optimization.",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "mamba",
    "uncensored",
    "reasoning",
    "chain-of-thought",
    "qwen3.5",
    "image-text-to-text",
    "en",
    "zh",
    "ko",
    "dataset:nohurry/Opus-4.6-Reasoning-3000x-filtered",
    "dataset:Jackrong/Qwen3.5-reasoning-700x",
    "dataset:Roman1111111/claude-opus-4.6-10000x",
    "base_model:Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2",
    "base_model:quantized:Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2",
    "license:apache-2.0",
    "endpoints_compatible",
    "region:us",
    "conversational"
  ],
  "likes": 3,
  "downloads": 6979,
  "gated": false,
  "private": false,
  "last_modified": "2026-04-06T00:34:54.000Z",
  "created_at": "2026-04-02T09:36:05.000Z",
  "pipeline_tag": "image-text-to-text",
  "library_name": ""
}

Source payload excerpt (from Hugging Face API)

{
  "_id": "69ce3885bf4f6d06eb918c42",
  "id": "Flakily6416/Qwen3.5-27B-Opus-Reasoning-v2-Abliterated-EvoPress-GGUF",
  "modelId": "Flakily6416/Qwen3.5-27B-Opus-Reasoning-v2-Abliterated-EvoPress-GGUF",
  "sha": "dc00fca088b233cca176a7601a3ba25b3c633b99",
  "createdAt": "2026-04-02T09:36:05.000Z",
  "lastModified": "2026-04-06T00:34:54.000Z",
  "author": "Flakily6416",
  "downloads": 6979,
  "likes": 3,
  "gated": false,
  "private": false,
  "pipeline_tag": "image-text-to-text",
  "library_name": "",
  "siblings_count": 10
}

flakily6416/qwen3.5-27b-opus-reasoning-v2-abliterated-evopress-gguf overview

Repository Files & Downloads

Model Details Live

Metadata Inspector

More models in this shard