aessedai/qwen3.5-122b-a10b-gguf IQ2_XXS GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.

Model Intelligence Sheet

aessedai/qwen3.5-122b-a10b-gguf overview

Updates ### 3/10/2026 I've uploaded new quants using the new fused Up + Gate conversion, this offers up to a +10% boost in prompt processing speed from my testing.

ggufbase_model:Qwen/Qwen3.5-122B-A10Bbase_model:quantized:Qwen/Qwen3.5-122B-A10Bendpoints_compatibleregion:usimatrixconversational

Downloads

5,312

Likes

Pipeline

—

Library

—

Visibility

Public

Access

Open

Repository Files & Downloads

18 files detected

Direct downloads for all repository files

File	Type	Quantization	Size	Link
Qwen3.5-122B-A10B-IQ2_XXS-00001-of-00002.gguf	GGUF	IQ2_XXS	10.44 MB	Download
Qwen3.5-122B-A10B-IQ2_XXS-00002-of-00002.gguf	GGUF	IQ2_XXS	31.58 GB	Download
Qwen3.5-122B-A10B-IQ3_S-00001-of-00002.gguf	GGUF	IQ3_S	10.44 MB	Download
Qwen3.5-122B-A10B-IQ3_S-00002-of-00002.gguf	GGUF	IQ3_S	43.39 GB	Download
Qwen3.5-122B-A10B-IQ4_XS-00001-of-00003.gguf	GGUF	IQ4_XS	10.44 MB	Download
Qwen3.5-122B-A10B-IQ4_XS-00002-of-00003.gguf	GGUF	IQ4_XS	46.52 GB	Download
Qwen3.5-122B-A10B-IQ4_XS-00003-of-00003.gguf	GGUF	IQ4_XS	9.78 GB	Download
Qwen3.5-122B-A10B-Q4_K_M-00001-of-00003.gguf	GGUF	Q4_K_M	10.44 MB	Download
Qwen3.5-122B-A10B-Q4_K_M-00002-of-00003.gguf	GGUF	Q4_K_M	45.83 GB	Download
Qwen3.5-122B-A10B-Q4_K_M-00003-of-00003.gguf	GGUF	Q4_K_M	25.65 GB	Download
Qwen3.5-122B-A10B-Q5_K_M-00001-of-00003.gguf	GGUF	Q5_K_M	10.44 MB	Download
Qwen3.5-122B-A10B-Q5_K_M-00002-of-00003.gguf	GGUF	Q5_K_M	45.82 GB	Download
Qwen3.5-122B-A10B-Q5_K_M-00003-of-00003.gguf	GGUF	Q5_K_M	39.44 GB	Download
imatrix.gguf	GGUF	—	198.23 MB	Download
mmproj-Qwen3.5-122B-A10B-BF16.gguf	GGUF	BF16	870.00 MB	Download
mmproj-Qwen3.5-122B-A10B-F16.gguf	GGUF	F16	866.63 MB	Download
mmproj-Qwen3.5-122B-A10B-F32.gguf	GGUF	F32	1.68 GB	Download
mmproj-Qwen3.5-122B-A10B-Q8_0.gguf	GGUF	—	590.53 MB	Download

Model Details Live

Model Slug

aessedai/qwen3.5-122b-a10b-gguf

Author

AesSedai

Pipeline Task

—

Library

—

Created

2026-02-24

Last Modified

2026-03-13

Gated

Private

HF SHA

9bf9aae2683153d2920fdbb49526b8ccabf1a5a8

License

Unknown

Language

Unknown

Base Model

Qwen/Qwen3.5-122B-A10B

Metadata Inspector

Normalized metadata (stored in metadata_json)

{
  "metadata": {},
  "card_data": {
    "base_model": [
      "Qwen/Qwen3.5-122B-A10B"
    ],
    "frontmatter": {
      "base_model": [
        "Qwen/Qwen3.5-122B-A10B"
      ]
    },
    "hero_image_url": "kld_data/01_kld_vs_filesize.png \"Chart showing Pareto KLD analysis of quants\"",
    "summary": "## Updates ### 3/10/2026 I've uploaded new quants using the new fused Up + Gate conversion, this offers up to a +10% boost in prompt processing speed from my testing.",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nbase_model:\n- Qwen/Qwen3.5-122B-A10B\n---\n## Updates\n### 3/10/2026\nI've uploaded new quants using the new fused Up + Gate conversion, this offers up to a +10% boost in prompt processing speed from my testing.\n\n## Description\nThis repo contains specialized MoE-quants for Qwen3.5-122B-A10B. The idea being that given the huge size of the FFN tensors compared to the rest of the tensors in the model, it should be possible to achieve a better quality while keeping the overall size of the entire model smaller compared to a similar naive quantization. To that end, the quantization type default is kept in high quality and the FFN UP + FFN GATE tensors are quanted down along with the FFN DOWN tensors.\n\n| Quant | Size | Mixture | PPL | 1-(Mean PPL(Q)/PPL(base)) | KLD |\n| :--------- | :--------- | :------- | :------- | :------- | :------- |\n| Q5_K_M | 85.26 GiB (6.00 BPW) | Q8_0 / Q5_K / Q5_K / Q6_K | 4.822883 ± 0.028429 | +0.1056% | 0.005545 ± 0.000044 |\n| Q4_K_M | 71.48 GiB (5.03 BPW) | Q8_0 / Q4_K / Q4_K / Q5_K | 4.830384 ± 0.028459 | +0.2613% | 0.010455 ± 0.000084 |\n| IQ4_XS | 56.29 GiB (3.96 BPW) | Q8_0 / IQ3_S / IQ3_S / IQ4_XS | 4.914250 ± 0.028952 | +2.0020% | 0.027787 ± 0.000206 |\n| IQ3_S | 43.39 GiB (3.05 BPW) | Q8_0 / IQ2_S / IQ2_S / IQ3_S | 5.126355 ± 0.030507 | +6.4046% | 0.074562 ± 0.000524 |\n| IQ2_XXS | 31.58 GiB (2.22 BPW) | Q4_K / IQ2_XXS / IQ2_XXS / IQ2_XXS | 5.727638 ± 0.035038 | +18.8850% | 0.185195 ± 0.001112 |\n\n![kld_graph](kld_data/01_kld_vs_filesize.png \"Chart showing Pareto KLD analysis of quants\")\n![ppl_graph](kld_data/02_ppl_vs_filesize.png \"Chart showing Pareto PPL analysis of quants\")",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "base_model:Qwen/Qwen3.5-122B-A10B",
    "base_model:quantized:Qwen/Qwen3.5-122B-A10B",
    "endpoints_compatible",
    "region:us",
    "imatrix",
    "conversational"
  ],
  "likes": 42,
  "downloads": 5312,
  "gated": false,
  "private": false,
  "last_modified": "2026-03-13T02:49:21.000Z",
  "created_at": "2026-02-24T17:34:49.000Z",
  "pipeline_tag": "",
  "library_name": ""
}

Source payload excerpt (from Hugging Face API)

{
  "_id": "699de139f9f2dd55f075f303",
  "id": "AesSedai/Qwen3.5-122B-A10B-GGUF",
  "modelId": "AesSedai/Qwen3.5-122B-A10B-GGUF",
  "sha": "9bf9aae2683153d2920fdbb49526b8ccabf1a5a8",
  "createdAt": "2026-02-24T17:34:49.000Z",
  "lastModified": "2026-03-13T02:49:21.000Z",
  "author": "AesSedai",
  "downloads": 5312,
  "likes": 42,
  "gated": false,
  "private": false,
  "pipeline_tag": "",
  "library_name": "",
  "siblings_count": 29
}

aessedai/qwen3.5-122b-a10b-gguf overview

Repository Files & Downloads

Model Details Live

Metadata Inspector

More models in this shard