Model Intelligence Sheet

nuofang/huihui-qwen3.5-9b-claude-4.6-opus-abliterated-gguf overview

Auto-Quantized GGUF Model This repository contains automated GGUF quantization files for huihui-ai/Huihui-Qwen3.5-9B-Claude-4.6-Opus-abliterated. Q5KM: Moderately compressed and effectively leverages the imatrix, making it nearly indistinguishable from the original precision. (recommended) IQ4XS: Compressed to the minimum footprint with only a slight degradation in quality. The calibration data for the imatrix is targeted at Chinese novels and role-playing (RP), while preserving logic and common sense. Q5KM: 适度压缩且发挥了imatrix的作用，难以察觉到与原精度的区别。（推荐） IQ4XS: 在只有轻微质量下降的情况下，压缩到最小占用。 imatrix 的校准数据以中文的小说、角色扮演为目标，同时保留逻辑和常识。

ggufllama.cppquantizedimatrixbase_model:huihui-ai/Huihui-Qwen3.5-9B-Claude-4.6-Opus-abliteratedbase_model:quantized:huihui-ai/Huihui-Qwen3.5-9B-Claude-4.6-Opus-abliteratedendpoints_compatibleregion:usconversational

nuofang/huihui-qwen3.5-9b-claude-4.6-opus-abliterated-gguf visual

Downloads

1,398

Likes

Pipeline

—

Library

—

Visibility

Public

Access

Open

Repository Files & Downloads

4 files detected

Direct downloads for all repository files

File	Type	Quantization	Size	Link
Huihui-Qwen3.5-9B-Claude-4.6-Opus-abliterated-IQ4_XS.gguf	GGUF	IQ4_XS	4.84 GB	Download
Huihui-Qwen3.5-9B-Claude-4.6-Opus-abliterated-Q5_K_M.gguf	GGUF	Q5_K_M	6.02 GB	Download
imatrix.gguf	GGUF	—	4.91 MB	Download
mmproj-Huihui-Qwen3.5-9B-Claude-4.6-Opus-abliterated-f16.gguf	GGUF	F16	879.01 MB	Download

Model Details Live

Model Slug

nuofang/huihui-qwen3.5-9b-claude-4.6-opus-abliterated-gguf

Author

nuofang

Pipeline Task

—

Library

—

Created

2026-03-22

Last Modified

2026-03-22

Gated

Private

HF SHA

8cdba648117b69fe3133c3dedf169c3feb19b7e3

License

Unknown

Language

Unknown

Base Model

huihui-ai/Huihui-Qwen3.5-9B-Claude-4.6-Opus-abliterated

Metadata Inspector

Normalized metadata (stored in metadata_json)

{
  "metadata": {},
  "card_data": {
    "base_model": "huihui-ai/Huihui-Qwen3.5-9B-Claude-4.6-Opus-abliterated",
    "tags": [
      "llama.cpp",
      "quantized",
      "imatrix"
    ],
    "frontmatter": {
      "base_model": "huihui-ai/Huihui-Qwen3.5-9B-Claude-4.6-Opus-abliterated",
      "tags": [
        "llama.cpp",
        "quantized",
        "imatrix"
      ]
    },
    "hero_image_url": "",
    "summary": "# Auto-Quantized GGUF Model This repository contains automated GGUF quantization files for huihui-ai/Huihui-Qwen3.5-9B-Claude-4.6-Opus-abliterated. Q5_K_M: Moderately compressed and effectively leverages the imatrix, making it nearly indistinguishable from the original precision. (recommended) IQ4_XS: Compressed to the minimum footprint with only a slight degradation in quality. The calibration data for the imatrix is targeted at Chinese novels and role-playing (RP), while preserving logic and common sense. Q5_K_M: 适度压缩且发挥了imatrix的作用，难以察觉到与原精度的区别。（推荐） IQ4_XS: 在只有轻微质量下降的情况下，压缩到最小占用。 imatrix 的校准数据以中文的小说、角色扮演为目标，同时保留逻辑和常识。",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nbase_model: huihui-ai/Huihui-Qwen3.5-9B-Claude-4.6-Opus-abliterated\ntags:\n- llama.cpp\n- quantized\n- imatrix\n---\n# Auto-Quantized GGUF Model\nThis repository contains automated GGUF quantization files for [huihui-ai/Huihui-Qwen3.5-9B-Claude-4.6-Opus-abliterated](https://huggingface.co/huihui-ai/Huihui-Qwen3.5-9B-Claude-4.6-Opus-abliterated).  \n\nQ5_K_M: Moderately compressed and effectively leverages the imatrix, making it nearly indistinguishable from the original precision. (recommended)  \nIQ4_XS: Compressed to the minimum footprint with only a slight degradation in quality.   \nThe calibration data for the imatrix is targeted at Chinese novels and role-playing (RP), while preserving logic and common sense.  \n\nQ5_K_M: 适度压缩且发挥了imatrix的作用，难以察觉到与原精度的区别。（推荐）  \nIQ4_XS: 在只有轻微质量下降的情况下，压缩到最小占用。  \nimatrix 的校准数据以中文的小说、角色扮演为目标，同时保留逻辑和常识。\n\n\n## 📊 Perplexity Evaluation\n*(Tested against the provided calibration dataset)*\n\n- **Base (F16/BF16)**: PPL = 16.4336 +/- 0.14178\n- **Q5_K_M**: PPL = 14.6771 +/- 0.12020\n- **IQ4_XS**: PPL = 14.9560 +/- 0.12308\n",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "llama.cpp",
    "quantized",
    "imatrix",
    "base_model:huihui-ai/Huihui-Qwen3.5-9B-Claude-4.6-Opus-abliterated",
    "base_model:quantized:huihui-ai/Huihui-Qwen3.5-9B-Claude-4.6-Opus-abliterated",
    "endpoints_compatible",
    "region:us",
    "conversational"
  ],
  "likes": 0,
  "downloads": 1398,
  "gated": false,
  "private": false,
  "last_modified": "2026-03-22T05:30:56.000Z",
  "created_at": "2026-03-22T02:39:55.000Z",
  "pipeline_tag": "",
  "library_name": ""
}

Source payload excerpt (from Hugging Face API)

{
  "_id": "69bf567b8ef670b5bf4df555",
  "id": "nuofang/Huihui-Qwen3.5-9B-Claude-4.6-Opus-abliterated-GGUF",
  "modelId": "nuofang/Huihui-Qwen3.5-9B-Claude-4.6-Opus-abliterated-GGUF",
  "sha": "8cdba648117b69fe3133c3dedf169c3feb19b7e3",
  "createdAt": "2026-03-22T02:39:55.000Z",
  "lastModified": "2026-03-22T05:30:56.000Z",
  "author": "nuofang",
  "downloads": 1398,
  "likes": 0,
  "gated": false,
  "private": false,
  "pipeline_tag": "",
  "library_name": "",
  "siblings_count": 6
}