GraySoft
Projects Models About FAQ Contact Download guIDE →

aessedai/qwen3.5-122b-a10b-gguf IQ2_XXS GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.

Model Intelligence Sheet

aessedai/qwen3.5-122b-a10b-gguf overview

Updates ### 3/10/2026 I've uploaded new quants using the new fused Up + Gate conversion, this offers up to a +10% boost in prompt processing speed from my testing.

ggufbase_model:Qwen/Qwen3.5-122B-A10Bbase_model:quantized:Qwen/Qwen3.5-122B-A10Bendpoints_compatibleregion:usimatrixconversational
aessedai/qwen3.5-122b-a10b-gguf visual
Downloads
5,312
Likes
42
Pipeline
Library
Visibility
Public
Access
Open

Repository Files & Downloads

18 files detected
Direct downloads for all repository files
FileTypeQuantizationSizeLink
Qwen3.5-122B-A10B-IQ2_XXS-00001-of-00002.gguf GGUF IQ2_XXS 10.44 MB Download
Qwen3.5-122B-A10B-IQ2_XXS-00002-of-00002.gguf GGUF IQ2_XXS 31.58 GB Download
Qwen3.5-122B-A10B-IQ3_S-00001-of-00002.gguf GGUF IQ3_S 10.44 MB Download
Qwen3.5-122B-A10B-IQ3_S-00002-of-00002.gguf GGUF IQ3_S 43.39 GB Download
Qwen3.5-122B-A10B-IQ4_XS-00001-of-00003.gguf GGUF IQ4_XS 10.44 MB Download
Qwen3.5-122B-A10B-IQ4_XS-00002-of-00003.gguf GGUF IQ4_XS 46.52 GB Download
Qwen3.5-122B-A10B-IQ4_XS-00003-of-00003.gguf GGUF IQ4_XS 9.78 GB Download
Qwen3.5-122B-A10B-Q4_K_M-00001-of-00003.gguf GGUF Q4_K_M 10.44 MB Download
Qwen3.5-122B-A10B-Q4_K_M-00002-of-00003.gguf GGUF Q4_K_M 45.83 GB Download
Qwen3.5-122B-A10B-Q4_K_M-00003-of-00003.gguf GGUF Q4_K_M 25.65 GB Download
Qwen3.5-122B-A10B-Q5_K_M-00001-of-00003.gguf GGUF Q5_K_M 10.44 MB Download
Qwen3.5-122B-A10B-Q5_K_M-00002-of-00003.gguf GGUF Q5_K_M 45.82 GB Download
Qwen3.5-122B-A10B-Q5_K_M-00003-of-00003.gguf GGUF Q5_K_M 39.44 GB Download
imatrix.gguf GGUF 198.23 MB Download
mmproj-Qwen3.5-122B-A10B-BF16.gguf GGUF BF16 870.00 MB Download
mmproj-Qwen3.5-122B-A10B-F16.gguf GGUF F16 866.63 MB Download
mmproj-Qwen3.5-122B-A10B-F32.gguf GGUF F32 1.68 GB Download
mmproj-Qwen3.5-122B-A10B-Q8_0.gguf GGUF 590.53 MB Download

Model Details Live

Model Slug
aessedai/qwen3.5-122b-a10b-gguf
Author
AesSedai
Pipeline Task
Library
Created
2026-02-24
Last Modified
2026-03-13
Gated
No
Private
No
HF SHA
9bf9aae2683153d2920fdbb49526b8ccabf1a5a8
License
Unknown
Language
Unknown
Base Model
Qwen/Qwen3.5-122B-A10B

Metadata Inspector

Normalized metadata (stored in metadata_json)
{
  "metadata": {},
  "card_data": {
    "base_model": [
      "Qwen/Qwen3.5-122B-A10B"
    ],
    "frontmatter": {
      "base_model": [
        "Qwen/Qwen3.5-122B-A10B"
      ]
    },
    "hero_image_url": "kld_data/01_kld_vs_filesize.png \"Chart showing Pareto KLD analysis of quants\"",
    "summary": "## Updates ### 3/10/2026 I've uploaded new quants using the new fused Up + Gate conversion, this offers up to a +10% boost in prompt processing speed from my testing.",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nbase_model:\n- Qwen/Qwen3.5-122B-A10B\n---\n## Updates\n### 3/10/2026\nI've uploaded new quants using the new fused Up + Gate conversion, this offers up to a +10% boost in prompt processing speed from my testing.\n\n## Description\nThis repo contains specialized MoE-quants for Qwen3.5-122B-A10B. The idea being that given the huge size of the FFN tensors compared to the rest of the tensors in the model, it should be possible to achieve a better quality while keeping the overall size of the entire model smaller compared to a similar naive quantization. To that end, the quantization type default is kept in high quality and the FFN UP + FFN GATE tensors are quanted down along with the FFN DOWN tensors.\n\n| Quant | Size | Mixture | PPL | 1-(Mean PPL(Q)/PPL(base)) | KLD |\n| :--------- | :--------- | :------- | :------- | :------- | :------- |\n| Q5_K_M | 85.26 GiB (6.00 BPW) | Q8_0 / Q5_K / Q5_K / Q6_K | 4.822883 ± 0.028429 | +0.1056% | 0.005545 ± 0.000044 |\n| Q4_K_M | 71.48 GiB (5.03 BPW) | Q8_0 / Q4_K / Q4_K / Q5_K | 4.830384 ± 0.028459 | +0.2613% | 0.010455 ± 0.000084 |\n| IQ4_XS | 56.29 GiB (3.96 BPW) | Q8_0 / IQ3_S / IQ3_S / IQ4_XS | 4.914250 ± 0.028952 | +2.0020% | 0.027787 ± 0.000206 |\n| IQ3_S | 43.39 GiB (3.05 BPW) | Q8_0 / IQ2_S / IQ2_S / IQ3_S | 5.126355 ± 0.030507 | +6.4046% | 0.074562 ± 0.000524 |\n| IQ2_XXS | 31.58 GiB (2.22 BPW) | Q4_K / IQ2_XXS / IQ2_XXS / IQ2_XXS | 5.727638 ± 0.035038 | +18.8850% | 0.185195 ± 0.001112 |\n\n![kld_graph](kld_data/01_kld_vs_filesize.png \"Chart showing Pareto KLD analysis of quants\")\n![ppl_graph](kld_data/02_ppl_vs_filesize.png \"Chart showing Pareto PPL analysis of quants\")",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "base_model:Qwen/Qwen3.5-122B-A10B",
    "base_model:quantized:Qwen/Qwen3.5-122B-A10B",
    "endpoints_compatible",
    "region:us",
    "imatrix",
    "conversational"
  ],
  "likes": 42,
  "downloads": 5312,
  "gated": false,
  "private": false,
  "last_modified": "2026-03-13T02:49:21.000Z",
  "created_at": "2026-02-24T17:34:49.000Z",
  "pipeline_tag": "",
  "library_name": ""
}
Source payload excerpt (from Hugging Face API)
{
  "_id": "699de139f9f2dd55f075f303",
  "id": "AesSedai/Qwen3.5-122B-A10B-GGUF",
  "modelId": "AesSedai/Qwen3.5-122B-A10B-GGUF",
  "sha": "9bf9aae2683153d2920fdbb49526b8ccabf1a5a8",
  "createdAt": "2026-02-24T17:34:49.000Z",
  "lastModified": "2026-03-13T02:49:21.000Z",
  "author": "AesSedai",
  "downloads": 5312,
  "likes": 42,
  "gated": false,
  "private": false,
  "pipeline_tag": "",
  "library_name": "",
  "siblings_count": 29
}