k0ndra/qwen3.5-35b-a3b-heretic-v2-ja-imatrix-gguf - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.

Model Intelligence Sheet

k0ndra/qwen3.5-35b-a3b-heretic-v2-ja-imatrix-gguf overview

日本語を主体としたImportance MatrixによるGGUF量子化です。 Japanese-focused imatrix GGUF quantizations of llmfan46/Qwen3.5-35B-A3B-heretic-v2.

ggufimatrixjapaneseqwen3moeabliteratedtext-generationjaenbase_model:llmfan46/Qwen3.5-35B-A3B-heretic-v2base_model:quantized:llmfan46/Qwen3.5-35B-A3B-heretic-v2license:apache-2.0endpoints_compatibleregion:usconversational

k0ndra/qwen3.5-35b-a3b-heretic-v2-ja-imatrix-gguf visual

Downloads

6,181

Likes

Pipeline

text-generation

Library

—

Visibility

Public

Access

Open

Repository Files & Downloads

4 files detected

Direct downloads for all repository files

File	Type	Quantization	Size	Link
Qwen3.5-35B-A3B-heretic-v2_IQ3_XXS.gguf	GGUF	IQ3_XXS	12.69 GB	Download
Qwen3.5-35B-A3B-heretic-v2_IQ4_XS.gguf	GGUF	IQ4_XS	17.44 GB	Download
Qwen3.5-35B-A3B-heretic-v2_Q4_K_M.gguf	GGUF	Q4_K_M	19.71 GB	Download
mmproj-Qwen3.5-35B-A3B-heretic-v2-BF16.gguf	GGUF	BF16	861.00 MB	Download

Model Details Live

Model Slug

k0ndra/qwen3.5-35b-a3b-heretic-v2-ja-imatrix-gguf

Author

k0ndra

Pipeline Task

text-generation

Library

—

Created

2026-03-27

Last Modified

2026-04-03

Gated

Private

HF SHA

5d7d9792b71d31e4206395ffd9fb45cc1fa2b928

License

Unknown

Language

Unknown

Base Model

Unknown

Metadata Inspector

Normalized metadata (stored in metadata_json)

{
  "metadata": {},
  "card_data": {
    "license": "apache-2.0",
    "base_model": "llmfan46/Qwen3.5-35B-A3B-heretic-v2",
    "base_model_relation": "quantized",
    "quantized_by": "K0ndra",
    "pipeline_tag": "text-generation",
    "tags": [
      "gguf",
      "imatrix",
      "japanese",
      "qwen3",
      "moe",
      "abliterated"
    ],
    "language": [
      "ja",
      "en"
    ],
    "frontmatter": {},
    "hero_image_url": "",
    "summary": "日本語を主体としたImportance MatrixによるGGUF量子化です。 Japanese-focused imatrix GGUF quantizations of llmfan46/Qwen3.5-35B-A3B-heretic-v2.",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\r\nlicense: apache-2.0\r\nbase_model: llmfan46/Qwen3.5-35B-A3B-heretic-v2\r\nbase_model_relation: quantized\r\nquantized_by: K0ndra\r\npipeline_tag: text-generation\r\ntags:\r\n  - gguf\r\n  - imatrix\r\n  - japanese\r\n  - qwen3\r\n  - moe\r\n  - abliterated\r\nlanguage:\r\n  - ja\r\n  - en\r\n---\r\n\r\n# Qwen3.5-35B-A3B-heretic-v2 Japanese imatrix GGUF\r\n\r\n日本語を主体としたImportance MatrixによるGGUF量子化です。\r\n\r\nJapanese-focused imatrix GGUF quantizations of [llmfan46/Qwen3.5-35B-A3B-heretic-v2](https://huggingface.co/llmfan46/Qwen3.5-35B-A3B-heretic-v2).\r\n\r\n## 量子化情報\r\n\r\n- **元モデル:** [llmfan46/Qwen3.5-35B-A3B-heretic-v2](https://huggingface.co/llmfan46/Qwen3.5-35B-A3B-heretic-v2)\r\n- **ベースモデル:** [Qwen/Qwen3.5-35B-A3B](https://huggingface.co/Qwen/Qwen3.5-35B-A3B) (Apache 2.0)\r\n- **量子化ツール:** [llama.cpp](https://github.com/ggml-org/llama.cpp) `b8559`\r\n\r\n## imatrixについて\r\n\r\n日本語テキストを主体としたキャリブレーションデータでImportance Matrixを生成しています。\r\n\r\n（おそらく）英語データをメインにで生成されたimatrixと比較して、低ビット量子化（IQ3/IQ4クラス）において日本語の生成品質をより良く維持することを期待していましたが、no thinkingでの選択式や抽出型などの日本語ベンチマークでは微妙な結果でした。 **[llmfan46/Qwen3.5-35B-A3B-heretic-v2-GGUF](https://huggingface.co/llmfan46/Qwen3.5-35B-A3B-heretic-v2-GGUF)を使用することを推奨します。**Q6_K以上ではimatrixによる差異は小さいと思うので、本リポジトリでは低ビット量子化に絞って公開しています。\r\n\r\nimatrixデータファイル(`Qwen3.5-35B-A3B-heretic-v2.imatrix`)を同梱しているため、他の量子化タイプを生成したい場合にご利用いただけます。\r\n\r\n## ⚠️ 注意 / Disclaimer\r\n\r\nこのモデルは検閲除去処理が施されたモデルの量子化です。安全フィルターが大幅に緩和されており、有害・不適切なコンテンツを生成する可能性があります。出力内容の利用については利用者自身の責任においてご判断ください。\r\n\r\nThis is a quantization of an abliterated model with significantly reduced safety filters. Use at your own risk and responsibility.\r\n\r\n## クレジット\r\n\r\n- **元モデル:** [llmfan46](https://huggingface.co/llmfan46)\r\n- **ベースモデル:** [Qwen Team](https://huggingface.co/Qwen)",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "imatrix",
    "japanese",
    "qwen3",
    "moe",
    "abliterated",
    "text-generation",
    "ja",
    "en",
    "base_model:llmfan46/Qwen3.5-35B-A3B-heretic-v2",
    "base_model:quantized:llmfan46/Qwen3.5-35B-A3B-heretic-v2",
    "license:apache-2.0",
    "endpoints_compatible",
    "region:us",
    "conversational"
  ],
  "likes": 2,
  "downloads": 6181,
  "gated": false,
  "private": false,
  "last_modified": "2026-04-03T13:58:41.000Z",
  "created_at": "2026-03-27T23:33:12.000Z",
  "pipeline_tag": "text-generation",
  "library_name": ""
}

Source payload excerpt (from Hugging Face API)

{
  "_id": "69c713b8664affbf554efccd",
  "id": "k0ndra/Qwen3.5-35B-A3B-heretic-v2-ja-imatrix-GGUF",
  "modelId": "k0ndra/Qwen3.5-35B-A3B-heretic-v2-ja-imatrix-GGUF",
  "sha": "5d7d9792b71d31e4206395ffd9fb45cc1fa2b928",
  "createdAt": "2026-03-27T23:33:12.000Z",
  "lastModified": "2026-04-03T13:58:41.000Z",
  "author": "k0ndra",
  "downloads": 6181,
  "likes": 2,
  "gated": false,
  "private": false,
  "pipeline_tag": "text-generation",
  "library_name": "",
  "siblings_count": 7
}

k0ndra/qwen3.5-35b-a3b-heretic-v2-ja-imatrix-gguf overview

Repository Files & Downloads

Model Details Live

Metadata Inspector

More models in this shard