luffythefox/qwen3.5-35b-a3b-uncensored-wasserstein-gguf Q4_K_L GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.

Model Intelligence Sheet

luffythefox/qwen3.5-35b-a3b-uncensored-wasserstein-gguf overview

Base model: HauhauCS/Qwen3.5-35B-A3B-Uncensored-HauhauCS-Aggressive - 0/465 refusals. Tensor drift repair by me. Method: Sig-ScaleSync-Wasserstein Quantization script available here: https://pastebin.com/hXhcMJn9 Feel free to do your own quants if you want. ---

ggufuncensoredqwen3.5moevisionmultimodalimage-text-to-textenzhmultilingualbase_model:Qwen/Qwen3.5-35B-A3Bbase_model:quantized:Qwen/Qwen3.5-35B-A3Blicense:apache-2.0endpoints_compatibleregion:usconversational

luffythefox/qwen3.5-35b-a3b-uncensored-wasserstein-gguf visual

Downloads

2,188

Likes

Pipeline

image-text-to-text

Library

—

Visibility

Public

Access

Open

Repository Files & Downloads

3 files detected

Direct downloads for all repository files

File	Type	Quantization	Size	Link
Qwen3.5-35B-A3B-Uncensored-Wasserstein-BF16.gguf	GGUF	BF16	64.61 GB	Download
Qwen3.5-35B-A3B-Uncensored-Wasserstein-Q4_K_L.gguf	GGUF	Q4_K_L	20.11 GB	Download
mmproj-FernflowerAI-f16.gguf	GGUF	F16	857.62 MB	Download

Model Details Live

Model Slug

luffythefox/qwen3.5-35b-a3b-uncensored-wasserstein-gguf

Author

LuffyTheFox

Pipeline Task

image-text-to-text

Library

—

Created

2026-04-16

Last Modified

2026-04-16

Gated

Private

HF SHA

8a5cb729b040da007534237ac263b22b19e28b15

License

apache-2.0

Language

en, zh, multilingual

Base Model

Qwen/Qwen3.5-35B-A3B

Metadata Inspector

Normalized metadata (stored in metadata_json)

{
  "metadata": {},
  "card_data": {
    "license": "apache-2.0",
    "tags": [
      "uncensored",
      "qwen3.5",
      "moe",
      "gguf",
      "vision",
      "multimodal"
    ],
    "language": [
      "en",
      "zh",
      "multilingual"
    ],
    "pipeline_tag": "image-text-to-text",
    "base_model": "Qwen/Qwen3.5-35B-A3B",
    "frontmatter": {
      "license": "apache-2.0",
      "tags": [
        "uncensored",
        "qwen3.5",
        "moe",
        "gguf",
        "vision",
        "multimodal"
      ],
      "language": [
        "en",
        "zh",
        "multilingual"
      ],
      "pipeline_tag": "image-text-to-text",
      "base_model": "Qwen/Qwen3.5-35B-A3B"
    },
    "hero_image_url": "",
    "summary": "Base model: HauhauCS/Qwen3.5-35B-A3B-Uncensored-HauhauCS-Aggressive - **0/465 refusals.** **Tensor drift repair by me. Method: Sig-ScaleSync-Wasserstein** **Quantization script available here: https://pastebin.com/hXhcMJn9** Feel free to do your own quants if you want. ---",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nlicense: apache-2.0\ntags:\n  - uncensored\n  - qwen3.5\n  - moe\n  - gguf\n  - vision\n  - multimodal\nlanguage:\n  - en\n  - zh\n  - multilingual\npipeline_tag: image-text-to-text\nbase_model: Qwen/Qwen3.5-35B-A3B\n---\n\n# Qwen3.5-35B-A3B-Uncensored-HauhauCS-Aggressive (Repaired) -> Wasserstein\n\nBase model: [HauhauCS/Qwen3.5-35B-A3B-Uncensored-HauhauCS-Aggressive](https://huggingface.co/HauhauCS/Qwen3.5-35B-A3B-Uncensored-HauhauCS-Aggressive) - **0/465 refusals.**\n\n**Tensor drift repair by me. Method: Sig-ScaleSync-[Wasserstein](https://en.wikipedia.org/wiki/Wasserstein_metric)** \n\n**Quantization script available here: https://pastebin.com/hXhcMJn9**\n\nFeel free to do your own quants if you want.\n\n---\n\n## Tensor Repair Summary\n\n**Model:** Qwen3.5-35B-A3B-Uncensored-HauhauCS-Aggressive-BF16.gguf (64.61 GB)\n\n| Metric | Value |\n|--------|-------|\n| Weight tensors analyzed | 500 |\n| Healthy (all criteria) | 497 |\n| Repaired (C2 - scale misalignment) | 3 |\n| Skipped (norms, embeddings, etc.) | 233 |\n\n**No issues found:** C1 (saturation), C3 (W1 divergence), C4 (ReLU asymmetry)\n\n---\n\n### Repair Statistics\n\n| Metric | Before | After | Improvement |\n|--------|--------|-------|-------------|\n| S (saturation error) | 0.0023 | 0.0009 | 63.0% |\n| W1 (Wasserstein-1) | 0.0034 | 0.0008 | 76.6% |\n\n**Scale repair coefficients (α):** min=0.580, mean=0.608, max=0.657\n\n---\n\n### Repaired Tensors (C2 — scale misalignment)\n\nAll three are `ssm_conv1d.weight` layers - the recurrent state transition layers responsible for long-context memory.\n\n| Block | α | D (log-ratio) | W1 before | W1 after |\n|-------|---|---------------|-----------|----------|\n| blk.36 | 0.5852 | 0.545 | 0.0037 | 0.0009 |\n| blk.37 | 0.5800 | 0.707 | 0.0039 | 0.0009 |\n| blk.38 | 0.6573 | 0.628 | 0.0026 | 0.0006 |\n\n**Interpretation:** All three layers were too loud (σ_w > σ_med by 50-100%). Scale correction restored them to peer median. W1 dropped by ~80%, confirming distribution shape normalized.\n\n---\n\n### Peer-Group Statistics (shape: 4×8192)\n\n| Metric | Value |\n|--------|-------|\n| Group size | 30 tensors |\n| Median σ_rob | 0.00652 |\n| Damaged tensors | 3 (blk.36, 37, 38) |\n\n**Scale misalignment distribution:** 3 tensors in [0.50, 1.00) log-ratio range.\n\n---\n\n### Verdict\n\nModel is clinically healthy. 497 out of 500 weight tensors passed all four criteria. Three SSM layers were repaired successfully. No saturation, no KL/W1 drift, no ReLU asymmetry. Ready for quantization.\n\n---\n\n## Usage\n\n**Ready to use.** Recommended quantization: **Q4_K_L, or higher** (Q4_K_M, Q5_K_M, Q6_K, Q8_0).  \n⚠️ Lower formats (Q3_K, Q2_K) break the model due to MoE + DeltaNet sensitivity.\n\n**Links:**\n- [Original uncensored model](https://huggingface.co/HauhauCS/Qwen3.5-35B-A3B-Uncensored-HauhauCS-Aggressive)\n- [Quantization Script with Unsloth profiles support](https://pastebin.com/hXhcMJn9)\n\n---\n\n## 🌟 Recommended Settings (LM Studio)\n\n**Chat template:** [pastebin.com/uk9ZkxCR](https://pastebin.com/uk9ZkxCR) (supports tool calling for Zed agent)\n\n| Parameter | Value |\n|-----------|-------|\n| Temperature | 0.7 |\n| Top K Sampling | 20 |\n| Presence Penalty | 1.5 |\n| Top P Sampling | 0.8 |\n| Min P Sampling | 0 |\n| Seed | 42 |\n\n**System prompt:** [pastebin.com/pU25DVnB](https://pastebin.com/pU25DVnB) (solid)  \nOr use this minimal string as the **first line**:\n\n> `You are Qwen, created by Alibaba Cloud. You are a helpful assistant.`\n\nThen add anything you want after. **Model may underperform without this first line.**\n\nAlso you can extend my System Prompt [pastebin.com/pU25DVnB](https://pastebin.com/pU25DVnB) for your own roleplay scenarios. Here how you can do it:\n\nEdit first string. Replace:\n\n> `You are Qwen, created by Alibaba Cloud. You are a helpful assistant.`\n\nWith\n\n> `You are Qwen, created by Alibaba Cloud. You are a helpful assistant. You are currently roleplaying as [your text here]`\n\n---\n\n## About\n\nNo changes to datasets or capabilities. Fully functional - 100% of what the original authors intended, just without refusals and with the critical architecture bug fixed on output layers.\n\n**These are meant to be the best lossless uncensored models out there.**\n\n---\n\n## Specs\n\n- 35B total parameters, ~3B active per forward pass (MoE)\n- 256 experts, 8 routed + 1 shared per token\n- Hybrid architecture: Gated DeltaNet linear attention + full softmax attention (3:1 ratio)\n- 40 layers, pattern: 10 × (3 × DeltaNet-MoE + 1 × Attention-MoE)\n- 262K native context (extendable to 1M with YaRN)\n- Natively multimodal (text, image, video)\n- Multi-token prediction (MTP) support\n- 248K vocabulary, 201 languages\n- Based on [Qwen/Qwen3.5-35B-A3B](https://huggingface.co/Qwen/Qwen3.5-35B-A3B)\n\n---\n\n## Recommended Settings (Official Qwen Authors)\n\n**Thinking mode (default):**\n- General: `temperature=1.0, top_p=0.95, top_k=20, min_p=0, presence_penalty=1.5`\n- Coding/precise tasks: `temperature=0.6, top_p=0.95, top_k=20, min_p=0, presence_penalty=0`\n\n**Non-thinking mode:**\n- General: `temperature=0.7, top_p=0.8, top_k=20, min_p=0, presence_penalty=1.5`\n- Reasoning tasks: `temperature=1.0, top_p=1.0, top_k=40, min_p=0, presence_penalty=2.0`\n\n**Important:**\n- Keep at least 128K context to preserve thinking capabilities\n- Use `--jinja` flag with llama.cpp for proper chat template handling\n- Vision support requires the `mmproj` file alongside the main GGUF\n\n---\n\n## Compatibility\n\nWorks with llama.cpp, LM Studio, koboldcpp, and other GGUF-compatible runtimes.",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "uncensored",
    "qwen3.5",
    "moe",
    "vision",
    "multimodal",
    "image-text-to-text",
    "en",
    "zh",
    "multilingual",
    "base_model:Qwen/Qwen3.5-35B-A3B",
    "base_model:quantized:Qwen/Qwen3.5-35B-A3B",
    "license:apache-2.0",
    "endpoints_compatible",
    "region:us",
    "conversational"
  ],
  "likes": 1,
  "downloads": 2188,
  "gated": false,
  "private": false,
  "last_modified": "2026-04-16T15:14:15.000Z",
  "created_at": "2026-04-16T08:47:53.000Z",
  "pipeline_tag": "image-text-to-text",
  "library_name": ""
}

Source payload excerpt (from Hugging Face API)

{
  "_id": "69e0a23952d270b64fb2bb9d",
  "id": "LuffyTheFox/Qwen3.5-35B-A3B-Uncensored-Wasserstein-GGUF",
  "modelId": "LuffyTheFox/Qwen3.5-35B-A3B-Uncensored-Wasserstein-GGUF",
  "sha": "8a5cb729b040da007534237ac263b22b19e28b15",
  "createdAt": "2026-04-16T08:47:53.000Z",
  "lastModified": "2026-04-16T15:14:15.000Z",
  "author": "LuffyTheFox",
  "downloads": 2188,
  "likes": 1,
  "gated": false,
  "private": false,
  "pipeline_tag": "image-text-to-text",
  "library_name": "",
  "siblings_count": 5
}

luffythefox/qwen3.5-35b-a3b-uncensored-wasserstein-gguf overview

Repository Files & Downloads

Model Details Live

Metadata Inspector

More models in this shard