Model Intelligence Sheet

noctrex/huihui-qwen3.5-35b-a3b-abliterated-mxfp4_moe-gguf overview

These are quantizations of the model Huihui-Qwen3.5-35B-A3B-abliterated Read the guide from unsloth in order to set up the model's recommended settings: Qwen3.5 - How to Run Locally Guide The mainline standard is to use MXFP4 for the MoE tensors, and Q8 for the rest. So I created a new variant, where the other tensors are BF16 instead of Q8. On some architectures BF16 will be slower, but its the highest quality, essentialy its the original tensors from the model copied over unquantized. As of 2026-03-03 the chat template has been updated, and is using the fixed version from unsloth. If you don't want to download the model again, you can just update the chat template.

ggufimage-text-to-textbase_model:huihui-ai/Huihui-Qwen3.5-35B-A3B-abliteratedbase_model:quantized:huihui-ai/Huihui-Qwen3.5-35B-A3B-abliteratedendpoints_compatibleregion:usconversational

noctrex/huihui-qwen3.5-35b-a3b-abliterated-mxfp4_moe-gguf visual

Downloads

2,895

Likes

Pipeline

image-text-to-text

Library

—

Visibility

Public

Access

Open

Repository Files & Downloads

4 files detected

Direct downloads for all repository files

File	Type	Quantization	Size	Link
Huihui-Qwen3.5-35B-A3B-abliterated-MXFP4_MOE.gguf	GGUF	—	20.55 GB	Download
mmproj-BF16.gguf	GGUF	BF16	861.00 MB	Download
mmproj-F16.gguf	GGUF	F16	857.62 MB	Download
mmproj-F32.gguf	GGUF	F32	1.66 GB	Download

Model Details Live

Model Slug

noctrex/huihui-qwen3.5-35b-a3b-abliterated-mxfp4_moe-gguf

Author

noctrex

Pipeline Task

image-text-to-text

Library

—

Created

2026-02-27

Last Modified

2026-03-03

Gated

Private

HF SHA

035c3b2524a020dd8984c9384a8350d3dd4193e9

License

Unknown

Language

Unknown

Base Model

huihui-ai/Huihui-Qwen3.5-35B-A3B-abliterated

Metadata Inspector

Normalized metadata (stored in metadata_json)

{
  "metadata": {},
  "card_data": {
    "pipeline_tag": "image-text-to-text",
    "base_model": [
      "huihui-ai/Huihui-Qwen3.5-35B-A3B-abliterated"
    ],
    "frontmatter": {
      "pipeline_tag": "image-text-to-text",
      "base_model": [
        "huihui-ai/Huihui-Qwen3.5-35B-A3B-abliterated"
      ]
    },
    "hero_image_url": "",
    "summary": "These are quantizations of the model Huihui-Qwen3.5-35B-A3B-abliterated Read the guide from unsloth in order to set up the model's recommended settings: Qwen3.5 - How to Run Locally Guide The mainline standard is to use MXFP4 for the MoE tensors, and Q8 for the rest. So I created a new variant, where the other tensors are BF16 instead of Q8. On some architectures BF16 will be slower, but its the highest quality, essentialy its the original tensors from the model copied over unquantized. As of 2026-03-03 the chat template has been updated, and is using the fixed version from unsloth. If you don't want to download the model again, you can just update the chat template.",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\npipeline_tag: image-text-to-text\nbase_model:\n- huihui-ai/Huihui-Qwen3.5-35B-A3B-abliterated\n---\nThese are quantizations of the model [Huihui-Qwen3.5-35B-A3B-abliterated](https://huggingface.co/huihui-ai/Huihui-Qwen3.5-35B-A3B-abliterated)\n\n- Download the latest [llama.cpp](https://github.com/ggml-org/llama.cpp) to use these quantizations.  \n- For the `mmproj` file, the F32 version is recommended for best results.  \n- Order of quality: F32 > BF16 > F16  \n\nRead the guide from unsloth in order to set up the model's recommended settings:  \n[Qwen3.5 - How to Run Locally Guide](https://unsloth.ai/docs/models/qwen3.5)\n\nThe mainline standard is to use MXFP4 for the MoE tensors, and Q8 for the rest.  \nSo I created a new variant, where the other tensors are BF16 instead of Q8.  \nOn some architectures BF16 will be slower, but its the highest quality, essentialy its the original tensors from the model copied over unquantized.\n\nAs of 2026-03-03 the chat template has been updated, and is using the fixed version from unsloth.  \nIf you don't want to download the model again, you can just update the chat template.\n",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "image-text-to-text",
    "base_model:huihui-ai/Huihui-Qwen3.5-35B-A3B-abliterated",
    "base_model:quantized:huihui-ai/Huihui-Qwen3.5-35B-A3B-abliterated",
    "endpoints_compatible",
    "region:us",
    "conversational"
  ],
  "likes": 11,
  "downloads": 2895,
  "gated": false,
  "private": false,
  "last_modified": "2026-03-03T10:01:27.000Z",
  "created_at": "2026-02-27T13:42:16.000Z",
  "pipeline_tag": "image-text-to-text",
  "library_name": ""
}

Source payload excerpt (from Hugging Face API)

{
  "_id": "69a19f38111a0fa5ed84ede7",
  "id": "noctrex/Huihui-Qwen3.5-35B-A3B-abliterated-MXFP4_MOE-GGUF",
  "modelId": "noctrex/Huihui-Qwen3.5-35B-A3B-abliterated-MXFP4_MOE-GGUF",
  "sha": "035c3b2524a020dd8984c9384a8350d3dd4193e9",
  "createdAt": "2026-02-27T13:42:16.000Z",
  "lastModified": "2026-03-03T10:01:27.000Z",
  "author": "noctrex",
  "downloads": 2895,
  "likes": 11,
  "gated": false,
  "private": false,
  "pipeline_tag": "image-text-to-text",
  "library_name": "",
  "siblings_count": 6
}