GraySoft
Projects Models About FAQ Contact Download guIDE →
Model Intelligence Sheet

noctrex/huihui-qwen3.5-35b-a3b-abliterated-mxfp4_moe-gguf overview

These are quantizations of the model Huihui-Qwen3.5-35B-A3B-abliterated Read the guide from unsloth in order to set up the model's recommended settings: Qwen3.5 - How to Run Locally Guide The mainline standard is to use MXFP4 for the MoE tensors, and Q8 for the rest. So I created a new variant, where the other tensors are BF16 instead of Q8. On some architectures BF16 will be slower, but its the highest quality, essentialy its the original tensors from the model copied over unquantized. As of 2026-03-03 the chat template has been updated, and is using the fixed version from unsloth. If you don't want to download the model again, you can just update the chat template.

ggufimage-text-to-textbase_model:huihui-ai/Huihui-Qwen3.5-35B-A3B-abliteratedbase_model:quantized:huihui-ai/Huihui-Qwen3.5-35B-A3B-abliteratedendpoints_compatibleregion:usconversational
noctrex/huihui-qwen3.5-35b-a3b-abliterated-mxfp4_moe-gguf visual
Downloads
2,895
Likes
11
Pipeline
image-text-to-text
Library
Visibility
Public
Access
Open

Repository Files & Downloads

4 files detected
Direct downloads for all repository files
FileTypeQuantizationSizeLink
Huihui-Qwen3.5-35B-A3B-abliterated-MXFP4_MOE.gguf GGUF 20.55 GB Download
mmproj-BF16.gguf GGUF BF16 861.00 MB Download
mmproj-F16.gguf GGUF F16 857.62 MB Download
mmproj-F32.gguf GGUF F32 1.66 GB Download

Model Details Live

Model Slug
noctrex/huihui-qwen3.5-35b-a3b-abliterated-mxfp4_moe-gguf
Author
noctrex
Pipeline Task
image-text-to-text
Library
Created
2026-02-27
Last Modified
2026-03-03
Gated
No
Private
No
HF SHA
035c3b2524a020dd8984c9384a8350d3dd4193e9
License
Unknown
Language
Unknown
Base Model
huihui-ai/Huihui-Qwen3.5-35B-A3B-abliterated

Metadata Inspector

Normalized metadata (stored in metadata_json)
{
  "metadata": {},
  "card_data": {
    "pipeline_tag": "image-text-to-text",
    "base_model": [
      "huihui-ai/Huihui-Qwen3.5-35B-A3B-abliterated"
    ],
    "frontmatter": {
      "pipeline_tag": "image-text-to-text",
      "base_model": [
        "huihui-ai/Huihui-Qwen3.5-35B-A3B-abliterated"
      ]
    },
    "hero_image_url": "",
    "summary": "These are quantizations of the model Huihui-Qwen3.5-35B-A3B-abliterated Read the guide from unsloth in order to set up the model's recommended settings: Qwen3.5 - How to Run Locally Guide The mainline standard is to use MXFP4 for the MoE tensors, and Q8 for the rest. So I created a new variant, where the other tensors are BF16 instead of Q8. On some architectures BF16 will be slower, but its the highest quality, essentialy its the original tensors from the model copied over unquantized. As of 2026-03-03 the chat template has been updated, and is using the fixed version from unsloth. If you don't want to download the model again, you can just update the chat template.",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\npipeline_tag: image-text-to-text\nbase_model:\n- huihui-ai/Huihui-Qwen3.5-35B-A3B-abliterated\n---\nThese are quantizations of the model [Huihui-Qwen3.5-35B-A3B-abliterated](https://huggingface.co/huihui-ai/Huihui-Qwen3.5-35B-A3B-abliterated)\n\n- Download the latest [llama.cpp](https://github.com/ggml-org/llama.cpp) to use these quantizations.  \n- For the `mmproj` file, the F32 version is recommended for best results.  \n- Order of quality: F32 > BF16 > F16  \n\nRead the guide from unsloth in order to set up the model's recommended settings:  \n[Qwen3.5 - How to Run Locally Guide](https://unsloth.ai/docs/models/qwen3.5)\n\nThe mainline standard is to use MXFP4 for the MoE tensors, and Q8 for the rest.  \nSo I created a new variant, where the other tensors are BF16 instead of Q8.  \nOn some architectures BF16 will be slower, but its the highest quality, essentialy its the original tensors from the model copied over unquantized.\n\nAs of 2026-03-03 the chat template has been updated, and is using the fixed version from unsloth.  \nIf you don't want to download the model again, you can just update the chat template.\n",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "image-text-to-text",
    "base_model:huihui-ai/Huihui-Qwen3.5-35B-A3B-abliterated",
    "base_model:quantized:huihui-ai/Huihui-Qwen3.5-35B-A3B-abliterated",
    "endpoints_compatible",
    "region:us",
    "conversational"
  ],
  "likes": 11,
  "downloads": 2895,
  "gated": false,
  "private": false,
  "last_modified": "2026-03-03T10:01:27.000Z",
  "created_at": "2026-02-27T13:42:16.000Z",
  "pipeline_tag": "image-text-to-text",
  "library_name": ""
}
Source payload excerpt (from Hugging Face API)
{
  "_id": "69a19f38111a0fa5ed84ede7",
  "id": "noctrex/Huihui-Qwen3.5-35B-A3B-abliterated-MXFP4_MOE-GGUF",
  "modelId": "noctrex/Huihui-Qwen3.5-35B-A3B-abliterated-MXFP4_MOE-GGUF",
  "sha": "035c3b2524a020dd8984c9384a8350d3dd4193e9",
  "createdAt": "2026-02-27T13:42:16.000Z",
  "lastModified": "2026-03-03T10:01:27.000Z",
  "author": "noctrex",
  "downloads": 2895,
  "likes": 11,
  "gated": false,
  "private": false,
  "pipeline_tag": "image-text-to-text",
  "library_name": "",
  "siblings_count": 6
}