GraySoft
Projects Models About FAQ Contact Download guIDE →
Model Intelligence Sheet

volko76/qwen3.5-122b-a10b-ud-iq4_xs-gguf-merged overview

from unsloth : https://huggingface.co/unsloth/Qwen3.5-122B-A10B-GGUF I simply took the UD-IQ4_XS and merged all the shards (0001.gguf, 0002.gguf, 0003.gguf) and merged them into one .gguf I used llama.cpp gguf-split tool to merge : https://github.com/ggml-org/llama.cpp/tree/master/tools/gguf-split Usefull for example for vLLM because they don't allow multishards Feel free to check out my website : https://cheapllm.shop for unlimited FREE inference of this model (during the beta, after that the pricing will be $0.02/M input and $0.10/M output so the cheapest provider by a big margin) If you're interested in D&D/RP, you can also check out https://fablia.fr for free D&D/RP experiences

ggufbase_model:Qwen/Qwen3.5-122B-A10Bbase_model:quantized:Qwen/Qwen3.5-122B-A10Bendpoints_compatibleregion:usimatrixconversational
volko76/qwen3.5-122b-a10b-ud-iq4_xs-gguf-merged visual
Downloads
616
Likes
0
Pipeline
Library
Visibility
Public
Access
Open

Repository Files & Downloads

1 files detected
Direct downloads for all repository files
FileTypeQuantizationSizeLink
Qwen3.5-122B-A10B-UD-IQ4_XS.gguf GGUF IQ4_XS 56.09 GB Download

Model Details Live

Model Slug
volko76/qwen3.5-122b-a10b-ud-iq4_xs-gguf-merged
Author
Volko76
Pipeline Task
Library
Created
2026-03-29
Last Modified
2026-03-31
Gated
No
Private
No
HF SHA
157147f629a2eadae0b0635e99f449114c3e1041
License
Unknown
Language
Unknown
Base Model
unsloth/Qwen3.5-122B-A10B-GGUF, Qwen/Qwen3.5-122B-A10B

Metadata Inspector

Normalized metadata (stored in metadata_json)
{
  "metadata": {},
  "card_data": {
    "base_model": [
      "unsloth/Qwen3.5-122B-A10B-GGUF",
      "Qwen/Qwen3.5-122B-A10B"
    ],
    "frontmatter": {
      "base_model": [
        "unsloth/Qwen3.5-122B-A10B-GGUF",
        "Qwen/Qwen3.5-122B-A10B"
      ]
    },
    "hero_image_url": "",
    "summary": "from unsloth : https://huggingface.co/unsloth/Qwen3.5-122B-A10B-GGUF I simply took the UD-IQ4_XS and merged all the shards (0001.gguf, 0002.gguf, 0003.gguf) and merged them into one .gguf I used llama.cpp gguf-split tool to merge : https://github.com/ggml-org/llama.cpp/tree/master/tools/gguf-split Usefull for example for vLLM because they don't allow multishards Feel free to check out my website : https://cheapllm.shop for unlimited FREE inference of this model (during the beta, after that the pricing will be $0.02/M input and $0.10/M output so the cheapest provider by a big margin) If you're interested in D&D/RP, you can also check out https://fablia.fr for free D&D/RP experiences",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nbase_model:\n- unsloth/Qwen3.5-122B-A10B-GGUF\n- Qwen/Qwen3.5-122B-A10B\n---\nfrom unsloth : https://huggingface.co/unsloth/Qwen3.5-122B-A10B-GGUF\n\nI simply took the UD-IQ4_XS and merged all the shards (0001.gguf, 0002.gguf, 0003.gguf) and merged them into one .gguf\nI used llama.cpp gguf-split tool to merge : https://github.com/ggml-org/llama.cpp/tree/master/tools/gguf-split\n\nUsefull for example for vLLM because they don't allow multishards\n\nFeel free to check out my website : https://cheapllm.shop for unlimited FREE inference of this model (during the beta, after that the pricing will be $0.02/M input and $0.10/M output so the cheapest provider by a big margin)\n\nIf you're interested in D&D/RP, you can also check out https://fablia.fr for free D&D/RP experiences",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "base_model:Qwen/Qwen3.5-122B-A10B",
    "base_model:quantized:Qwen/Qwen3.5-122B-A10B",
    "endpoints_compatible",
    "region:us",
    "imatrix",
    "conversational"
  ],
  "likes": 0,
  "downloads": 616,
  "gated": false,
  "private": false,
  "last_modified": "2026-03-31T15:08:09.000Z",
  "created_at": "2026-03-29T22:25:52.000Z",
  "pipeline_tag": "",
  "library_name": ""
}
Source payload excerpt (from Hugging Face API)
{
  "_id": "69c9a6f0d18d4aed320f7e18",
  "id": "Volko76/Qwen3.5-122B-A10B-UD-IQ4_XS-GGUF-MERGED",
  "modelId": "Volko76/Qwen3.5-122B-A10B-UD-IQ4_XS-GGUF-MERGED",
  "sha": "157147f629a2eadae0b0635e99f449114c3e1041",
  "createdAt": "2026-03-29T22:25:52.000Z",
  "lastModified": "2026-03-31T15:08:09.000Z",
  "author": "Volko76",
  "downloads": 616,
  "likes": 0,
  "gated": false,
  "private": false,
  "pipeline_tag": "",
  "library_name": "",
  "siblings_count": 3
}