GraySoft
Projects Models About FAQ Contact Download guIDE →

flakily6416/qwen3.5-27b-opus-reasoning-v2-abliterated-evopress-gguf Q6_K GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.

Model Intelligence Sheet

flakily6416/qwen3.5-27b-opus-reasoning-v2-abliterated-evopress-gguf overview

This repository features traditional and EvoPress GGUF quants of an abliterated reasoning model. Built upon Jackrong's Claude-4.6-Opus Distillation v2, the model was uncensored via the Orion-Zhen pipeline before being quantized with the EvoPress mixed-precision strategy.

ggufmambauncensoredreasoningchain-of-thoughtqwen3.5image-text-to-textenzhkodataset:nohurry/Opus-4.6-Reasoning-3000x-filtereddataset:Jackrong/Qwen3.5-reasoning-700xdataset:Roman1111111/claude-opus-4.6-10000xbase_model:Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2base_model:quantized:Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2license:apache-2.0endpoints_compatibleregion:usconversational
flakily6416/qwen3.5-27b-opus-reasoning-v2-abliterated-evopress-gguf visual
Downloads
6,979
Likes
3
Pipeline
image-text-to-text
Library
Visibility
Public
Access
Open

Repository Files & Downloads

8 files detected
Direct downloads for all repository files
FileTypeQuantizationSizeLink
Qwen-3.5-27B-Opus-Reasoning-Abliterated-EP3.5_bpw.gguf GGUF 11.57 GB Download
Qwen-3.5-27B-Opus-Reasoning-Abliterated-EP4.25_bpw.gguf GGUF 14.37 GB Download
Qwen-3.5-27B-Opus-Reasoning-Abliterated-EP5.0_bpw.gguf GGUF 16.97 GB Download
Qwen-3.5-27B-Opus-Reasoning-Abliterated-Q2_K.gguf GGUF Q2_K 9.98 GB Download
Qwen-3.5-27B-Opus-Reasoning-Abliterated-Q3_K.gguf GGUF Q3_K 12.39 GB Download
Qwen-3.5-27B-Opus-Reasoning-Abliterated-Q4_K.gguf GGUF Q4_K 15.41 GB Download
Qwen-3.5-27B-Opus-Reasoning-Abliterated-Q5_K.gguf GGUF Q5_K 17.91 GB Download
Qwen-3.5-27B-Opus-Reasoning-Abliterated-Q6_K.gguf GGUF Q6_K 20.57 GB Download

Model Details Live

Model Slug
flakily6416/qwen3.5-27b-opus-reasoning-v2-abliterated-evopress-gguf
Author
Flakily6416
Pipeline Task
image-text-to-text
Library
Created
2026-04-02
Last Modified
2026-04-06
Gated
No
Private
No
HF SHA
dc00fca088b233cca176a7601a3ba25b3c633b99
License
apache-2.0
Language
en, zh, ko
Base Model
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2

Metadata Inspector

Normalized metadata (stored in metadata_json)
{
  "metadata": {},
  "card_data": {
    "license": "apache-2.0",
    "language": [
      "en",
      "zh",
      "ko"
    ],
    "tags": [
      "gguf",
      "mamba",
      "uncensored",
      "reasoning",
      "chain-of-thought",
      "qwen3.5"
    ],
    "base_model": "Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2",
    "pipeline_tag": "image-text-to-text",
    "datasets": [
      "nohurry/Opus-4.6-Reasoning-3000x-filtered",
      "Jackrong/Qwen3.5-reasoning-700x",
      "Roman1111111/claude-opus-4.6-10000x"
    ],
    "frontmatter": {
      "license": "apache-2.0",
      "language": [
        "en",
        "zh",
        "ko"
      ],
      "tags": [
        "gguf",
        "mamba",
        "uncensored",
        "reasoning",
        "chain-of-thought",
        "qwen3.5"
      ],
      "base_model": "Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2",
      "pipeline_tag": "image-text-to-text",
      "datasets": [
        "nohurry/Opus-4.6-Reasoning-3000x-filtered",
        "Jackrong/Qwen3.5-reasoning-700x",
        "Roman1111111/claude-opus-4.6-10000x"
      ]
    },
    "hero_image_url": "",
    "summary": "This repository features traditional and EvoPress GGUF quants of an abliterated reasoning model. Built upon **Jackrong's Claude-4.6-Opus Distillation v2**, the model was uncensored via the **Orion-Zhen pipeline** before being quantized with the **EvoPress** mixed-precision strategy.",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nlicense: apache-2.0\nlanguage:\n- en\n- zh\n- ko\ntags:\n- gguf\n- mamba\n- uncensored\n- reasoning\n- chain-of-thought\n- qwen3.5\nbase_model: Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2\npipeline_tag: image-text-to-text\ndatasets:\n- nohurry/Opus-4.6-Reasoning-3000x-filtered\n- Jackrong/Qwen3.5-reasoning-700x\n- Roman1111111/claude-opus-4.6-10000x\n---\n# Note: This release currently does not have vision capabilities due to an oversight. \nI'll get this fixed as soon as my free lightning ai credits reset (or please get in touch if you would like to sponsor some A100 hours).\n\n# Qwen 3.5 27B Opus-Reasoning v2 (Abliterated) - Mixed Precision GGUFs\n\nThis repository features traditional and EvoPress GGUF quants of an abliterated reasoning model. Built upon [**Jackrong's Claude-4.6-Opus Distillation v2**](https://huggingface.co/Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2), the model was uncensored via the [**Orion-Zhen pipeline**](https://github.com/Orion-zhen/abliteration) before being quantized with the [**EvoPress**](https://huggingface.co/EvoPress) mixed-precision strategy.\n## 🔥 Model Lineage & Highlights\n1. **Base:** [Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2](https://huggingface.co/Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2) \n   - Distilled using 14,000+ Claude 4.6 Opus-style samples to drastically improve Chain-of-Thought (CoT) efficiency. \n   - Reduces unnecessarily long internal reasoning chains while maintaining top-tier benchmark scores (e.g., 96.91% pass@1 on HumanEval).\n2. **Abliteration:** Processed via Orion-Zhen's open-source pipeline.\n\n## ⚡ EvoPress Mixed-Precision Quantization\nThis release utilizes the **EvoPress** methodology to maximize intelligence-per-gigabyte in Hybrid Mamba architectures. Standard quantization often degrades the sensitive State Space Model (SSM) components; these quants solve that by using tiered-precision mapping.\n\n**Key Methodology:** [EvoPress (GitHub/HF)](https://huggingface.co/EvoPress)\n\n| File Name | Target BPW | VRAM Fit | Optimization Strategy |\n| :--- | :---: | :--- | :--- |\n| **Qwen-3.5-27B-Opus-Reasoning-Abliterated-EP3.5_bpw.gguf** | 3.5 | 12GB - 16GB | Q3_K Base + F32 Mamba/Norms |\n| **Qwen-3.5-27B-Opus-Reasoning-Abliterated-EP4.25_bpw.gguf** | 4.25 | 16GB - 24GB | Q4_K Base + F32 Mamba/Norms |\n| **Qwen-3.5-27B-Opus-Reasoning-Abliterated-EP5.0_bpw.gguf** | 5.0 | 24GB+ | Q5_K Base + F32 Mamba/Norms |\n\n## 📊 Standard/Traditional Quantizations\nIncluded for comparison and compatibility with older hardware.\n\n| File | Type | Size |\n| :--- | :--- | :--- |\n| Qwen-3.5-27B-Opus-Reasoning-Abliterated-Q2_K.gguf | Q2_K | ~10 GB |\n| Qwen-3.5-27B-Opus-Reasoning-Abliterated-Q3_K.gguf | Q3_K | ~13 GB |\n| Qwen-3.5-27B-Opus-Reasoning-Abliterated-Q4_K.gguf | Q4_K | ~16 GB |\n| Qwen-3.5-27B-Opus-Reasoning-Abliterated-Q5_K.gguf | Q5_K | ~19 GB |\n| Qwen-3.5-27B-Opus-Reasoning-Abliterated-Q6_K.gguf | Q6_K | ~23 GB |\n\n## 🛠️ Technical Details & Setup\n- **Architecture:** Hybrid Mamba-Transformer (Qwen 3.5)\n- **Quantization:** Performed using a modified `gptq-gguf-toolkit` for Mamba-aware layer mapping.\n- **Requirements:** Use a recent build of `llama.cpp` (March 2026+) for full Hybrid Mamba support.\n\n## 🤝 Credits & Acknowledgements\n- **Jackrong:** For the Claude-4.6-Opus reasoning distillation methodology and published model.\n- **Orion-Zhen:** For the abliteration and refusal-removal pipeline.\n- **Alibaba Qwen Team:** For the base Qwen 3.5 architecture.\n- **Flakily6416:** Quantization, layer-mapping, and Mixed-Precision optimization.",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "mamba",
    "uncensored",
    "reasoning",
    "chain-of-thought",
    "qwen3.5",
    "image-text-to-text",
    "en",
    "zh",
    "ko",
    "dataset:nohurry/Opus-4.6-Reasoning-3000x-filtered",
    "dataset:Jackrong/Qwen3.5-reasoning-700x",
    "dataset:Roman1111111/claude-opus-4.6-10000x",
    "base_model:Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2",
    "base_model:quantized:Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2",
    "license:apache-2.0",
    "endpoints_compatible",
    "region:us",
    "conversational"
  ],
  "likes": 3,
  "downloads": 6979,
  "gated": false,
  "private": false,
  "last_modified": "2026-04-06T00:34:54.000Z",
  "created_at": "2026-04-02T09:36:05.000Z",
  "pipeline_tag": "image-text-to-text",
  "library_name": ""
}
Source payload excerpt (from Hugging Face API)
{
  "_id": "69ce3885bf4f6d06eb918c42",
  "id": "Flakily6416/Qwen3.5-27B-Opus-Reasoning-v2-Abliterated-EvoPress-GGUF",
  "modelId": "Flakily6416/Qwen3.5-27B-Opus-Reasoning-v2-Abliterated-EvoPress-GGUF",
  "sha": "dc00fca088b233cca176a7601a3ba25b3c633b99",
  "createdAt": "2026-04-02T09:36:05.000Z",
  "lastModified": "2026-04-06T00:34:54.000Z",
  "author": "Flakily6416",
  "downloads": 6979,
  "likes": 3,
  "gated": false,
  "private": false,
  "pipeline_tag": "image-text-to-text",
  "library_name": "",
  "siblings_count": 10
}