GraySoft
Projects Models About FAQ Contact Download guIDE →
Model Intelligence Sheet

nuofang/huihui-qwen3.5-9b-claude-4.6-opus-abliterated-gguf overview

Auto-Quantized GGUF Model This repository contains automated GGUF quantization files for huihui-ai/Huihui-Qwen3.5-9B-Claude-4.6-Opus-abliterated. Q5KM: Moderately compressed and effectively leverages the imatrix, making it nearly indistinguishable from the original precision. (recommended) IQ4XS: Compressed to the minimum footprint with only a slight degradation in quality. The calibration data for the imatrix is targeted at Chinese novels and role-playing (RP), while preserving logic and common sense. Q5KM: 适度压缩且发挥了imatrix的作用,难以察觉到与原精度的区别。(推荐) IQ4XS: 在只有轻微质量下降的情况下,压缩到最小占用。 imatrix 的校准数据以中文的小说、角色扮演为目标,同时保留逻辑和常识。

ggufllama.cppquantizedimatrixbase_model:huihui-ai/Huihui-Qwen3.5-9B-Claude-4.6-Opus-abliteratedbase_model:quantized:huihui-ai/Huihui-Qwen3.5-9B-Claude-4.6-Opus-abliteratedendpoints_compatibleregion:usconversational
nuofang/huihui-qwen3.5-9b-claude-4.6-opus-abliterated-gguf visual
Downloads
1,398
Likes
0
Pipeline
Library
Visibility
Public
Access
Open

Repository Files & Downloads

4 files detected
Direct downloads for all repository files
FileTypeQuantizationSizeLink
Huihui-Qwen3.5-9B-Claude-4.6-Opus-abliterated-IQ4_XS.gguf GGUF IQ4_XS 4.84 GB Download
Huihui-Qwen3.5-9B-Claude-4.6-Opus-abliterated-Q5_K_M.gguf GGUF Q5_K_M 6.02 GB Download
imatrix.gguf GGUF 4.91 MB Download
mmproj-Huihui-Qwen3.5-9B-Claude-4.6-Opus-abliterated-f16.gguf GGUF F16 879.01 MB Download

Model Details Live

Model Slug
nuofang/huihui-qwen3.5-9b-claude-4.6-opus-abliterated-gguf
Author
nuofang
Pipeline Task
Library
Created
2026-03-22
Last Modified
2026-03-22
Gated
No
Private
No
HF SHA
8cdba648117b69fe3133c3dedf169c3feb19b7e3
License
Unknown
Language
Unknown
Base Model
huihui-ai/Huihui-Qwen3.5-9B-Claude-4.6-Opus-abliterated

Metadata Inspector

Normalized metadata (stored in metadata_json)
{
  "metadata": {},
  "card_data": {
    "base_model": "huihui-ai/Huihui-Qwen3.5-9B-Claude-4.6-Opus-abliterated",
    "tags": [
      "llama.cpp",
      "quantized",
      "imatrix"
    ],
    "frontmatter": {
      "base_model": "huihui-ai/Huihui-Qwen3.5-9B-Claude-4.6-Opus-abliterated",
      "tags": [
        "llama.cpp",
        "quantized",
        "imatrix"
      ]
    },
    "hero_image_url": "",
    "summary": "# Auto-Quantized GGUF Model This repository contains automated GGUF quantization files for huihui-ai/Huihui-Qwen3.5-9B-Claude-4.6-Opus-abliterated. Q5_K_M: Moderately compressed and effectively leverages the imatrix, making it nearly indistinguishable from the original precision. (recommended) IQ4_XS: Compressed to the minimum footprint with only a slight degradation in quality. The calibration data for the imatrix is targeted at Chinese novels and role-playing (RP), while preserving logic and common sense. Q5_K_M: 适度压缩且发挥了imatrix的作用,难以察觉到与原精度的区别。(推荐) IQ4_XS: 在只有轻微质量下降的情况下,压缩到最小占用。 imatrix 的校准数据以中文的小说、角色扮演为目标,同时保留逻辑和常识。",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nbase_model: huihui-ai/Huihui-Qwen3.5-9B-Claude-4.6-Opus-abliterated\ntags:\n- llama.cpp\n- quantized\n- imatrix\n---\n# Auto-Quantized GGUF Model\nThis repository contains automated GGUF quantization files for [huihui-ai/Huihui-Qwen3.5-9B-Claude-4.6-Opus-abliterated](https://huggingface.co/huihui-ai/Huihui-Qwen3.5-9B-Claude-4.6-Opus-abliterated).  \n\nQ5_K_M: Moderately compressed and effectively leverages the imatrix, making it nearly indistinguishable from the original precision. (recommended)  \nIQ4_XS: Compressed to the minimum footprint with only a slight degradation in quality.   \nThe calibration data for the imatrix is targeted at Chinese novels and role-playing (RP), while preserving logic and common sense.  \n\nQ5_K_M: 适度压缩且发挥了imatrix的作用,难以察觉到与原精度的区别。(推荐)  \nIQ4_XS: 在只有轻微质量下降的情况下,压缩到最小占用。  \nimatrix 的校准数据以中文的小说、角色扮演为目标,同时保留逻辑和常识。\n\n\n## 📊 Perplexity Evaluation\n*(Tested against the provided calibration dataset)*\n\n- **Base (F16/BF16)**: PPL = 16.4336 +/- 0.14178\n- **Q5_K_M**: PPL = 14.6771 +/- 0.12020\n- **IQ4_XS**: PPL = 14.9560 +/- 0.12308\n",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "llama.cpp",
    "quantized",
    "imatrix",
    "base_model:huihui-ai/Huihui-Qwen3.5-9B-Claude-4.6-Opus-abliterated",
    "base_model:quantized:huihui-ai/Huihui-Qwen3.5-9B-Claude-4.6-Opus-abliterated",
    "endpoints_compatible",
    "region:us",
    "conversational"
  ],
  "likes": 0,
  "downloads": 1398,
  "gated": false,
  "private": false,
  "last_modified": "2026-03-22T05:30:56.000Z",
  "created_at": "2026-03-22T02:39:55.000Z",
  "pipeline_tag": "",
  "library_name": ""
}
Source payload excerpt (from Hugging Face API)
{
  "_id": "69bf567b8ef670b5bf4df555",
  "id": "nuofang/Huihui-Qwen3.5-9B-Claude-4.6-Opus-abliterated-GGUF",
  "modelId": "nuofang/Huihui-Qwen3.5-9B-Claude-4.6-Opus-abliterated-GGUF",
  "sha": "8cdba648117b69fe3133c3dedf169c3feb19b7e3",
  "createdAt": "2026-03-22T02:39:55.000Z",
  "lastModified": "2026-03-22T05:30:56.000Z",
  "author": "nuofang",
  "downloads": 1398,
  "likes": 0,
  "gated": false,
  "private": false,
  "pipeline_tag": "",
  "library_name": "",
  "siblings_count": 6
}